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NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS I 

Cross-References to Related Applications 

This application is a continuation-in-part of U.S. provisional patent application serial 
5 no. 60/068,755, filed December 23, 1997, and of U.S. provisional patent application serial 
no. 60/080,664, filed April 3, 1998, and of U.S. provisional patent application serial no. 
60/105,234, filed October 21, 1998, each of which applications are incorporated herein by 
reference. 

10 Field of the Invention 

The present invention relates to novel polynucleotides, particularly to novel 
polynucleotides of human origin that are expressed in a selected cell type, are differentially 
expressed in one cell type relative to another cell type {e.g., in cancerous cells, or in cells of a 
specific tissue origin) and/or share homology to polynucleotides encoding a gene product 

15 having an identified functional domain and/or activity. 

Background of the Invention 

Identification of novel polynucleotides, particularly those that encode an expressed 
gene product, is important in the advancement of drug discovery, diagnostic technologies, 

20 and the understanding of the progression and nature of complex diseases such as cancer. 
Identification of genes expressed in different cell types isolated from sources that differ in 
disease state or stage, developmental stage, exposure to various environmental factors, the 
tissue of origin, the species from which the tissue was isolated, and the like is key to 
identifying the genetic factors that are responsible for the phenotypes associated with these 

25 various differences 

This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel 
polynucleotides. 

30 Summary of the Invention 

This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides 

1 
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and to proteins expressed by the genes. The invention also relates to diagnostic and 
therapeutic agents employing such novel human polynucleotides, their corresponding genes 
or gene products, e.g., these genes and proteins, including probes, antisense constructs, and 
antibodies. 



polynucleotides, the library comprising the sequence information of at least one of SEQ ID 
NOS: 1-844. In related aspects, the invention features a library provided on a nucleic acid 
array, or in a computer-readable format. 

In one embodiment, the library is comprises a differentially expressed polynucleotide 

1 0 comprising a sequence selected from the group consisting of SEQ ID NOS:9, 39, 42, 52, 62, 
74, 119, 172, 317, and 379. In specific related embodiments, the library comprises: 1) a 
polynucleotide that is differentially expressed in a human breast cancer cell, where the 
polynucleotide comprises a sequence selected from the group consisting of SEQ ID NOS: 4, 
9,39, 42, 52,62, 65,66, 68, 74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214, 

15 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388; 2) a polynucleotide differentially 
expressed in a human colon cancer cell, where the polynucleotide comprises a sequence 
selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 
288, 3 1 7, 357, 362, and 374; or 3) a polynucleotide differentially expressed in a human lung 
cancer cell, where the polynucleotide comprises a sequence selected from the group 

20 consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 



comprising a nucleotide sequence having at least 90% sequence identity to an identifying 

sequence of SEQ ID NOS: 1-844 or a degenerate variant thereof. In related aspects, the 
25 invention features recombinant host cells and vectors comprising the polynucleotides of the 

invention, as well as isolated polypeptides encoded by the polynucleotides of the invention 

and antibodies that specifically bind such polypeptides. 

In one embodiment, the invention features an isolated polynucleotide comprising a 

sequence encoding a polypeptide of a protein family selected from the group consisting of: 
30 4 transmembrane segments integral membrane proteins, 7 transmembrane receptors, 

ATPases associated with various cellular activities (AAA), eukaryotic aspartyl proteases, 



5 



Accordingly, in one embodiment, the present invention features a library of 



In another aspect, the invention features an isolated polynucleotide 
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GATA family of transcription factors, G-protein alpha subunit, phorbol esters/diacylglycerol 
binding proteins, protein kinase, protein phosphatase 2C, protein tyrosine phosphatase, 
trypsin, wnt family of developmental signaling proteins, and WW/rsp5/WWP domain 
containing proteins. In a specific related embodiment, the invention features a 
5 polynucleotide comprising a sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 
315,341,63, 116, 134, 136, 151,384, 404, 308,213,367, 188,251,202,315,367, 397,256, 
382, 169, 23, 291, 324, 330, 341, 353, 188, 379 , and 395. 

In another embodiment, the invention features a polynucleotide comprising a 
sequence encoding a polypeptide having a functional domain selected from the group 
10 consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 

bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metal loprotease domain. In a specific related 
embodiment, the invention features a polynucleotide comprising a sequence of one of SEQ 
ID NOS: 116, 251, 374, 97, 136, 242, 379, 306, 386, 18, 335, 61, 306, 386, 322, 306, and 



In another aspect, the invention features a method of detecting differentially 
expressed genes correlated with a cancerous state of a mammalian cell, where the method 
comprises the step of detecting at least one differentially expressed gene product in a test 
sample derived from a cell suspected of being cancerous, where the gene product is encoded 

20 by a gene corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 

65, 66, 68,74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219, 223,258,298, 
317,338,379,384,386,388, 1,39, 52, 97, 119, 134, 172, 176, 241,288,317,357, 362,374, 
9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323,349, 361,369, 371,379,395,381, 
and 400. Detection of the differentially expressed gene product is correlated with a 

25 cancerous state of the cell from which the test sample was derived. In one embodiment, the 
detecting is by hybridization of the test sample to a reference array, wherein the reference 
array comprises an identifying sequence of at least one of SEQ ID NOS: 1-844. 

In one embodiment of the method of the invention, the cell is a breast tissue derived 
cell, and the differentially expressed gene product is encoded by a gene corresponding to a 

30 sequence of at least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 



15 



395. 
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144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, 
and 388. 

In another embodiment of the method of the invention, the cell is a colon tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
5 a sequence of at least one of SEQ ID NOS: 1,39,52,97, 119, 134, 172, 176, 241,288,317, 
357, 362, and 374. 

In yet another embodiment of the method of the invention, the cell is a lung tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
a sequence of at least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 
10 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

Other aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 



15 The invention relates to polynucleotides comprising the disclosed nucleotide 

sequences, to full length cDNA, mRNA and genes corresponding to these sequences, and to 
polypeptides and proteins encoded by these polynucleotides and genes. 

Also included are polynucleotides that encode polypeptides and proteins encoded by 
the polynucleotides of the Sequence Listing. The various polynucleotides that can encode 

20 these polypeptides and proteins differ because of the degeneracy of the genetic code, in that 
most amino acids are encoded by more than one triplet codon. The identity of such codons 
is well-known in this art, and this information can be used for the construction of the 
polynucleotides within the scope of the invention. 



25 polypeptides and proteins encoded by the polynucleotides and related cDNA and genes are 
also within the scope of the invention. The variants differ from wild type protein in having 
one or more amino acid substitutions that either enhance, add, or diminish a biological 
activity of the wild type protein. Once the amino acid change is selected, a polynucleotide 
encoding that variant is constructed according to the invention. 

30 The following detailed description describes the polynucleotide compositions 

encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding a 
full-length gene product, expression of these polynucleotides and genes, identification of 



Detailed Description of the Invention 



Polynucleotides encoding polypeptides and proteins that are variants of the 
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structural motifs of the polynucleotides and genes, identification of the function of a gene 
product encoded by a gene corresponding to a polynucleotide of the invention, use of the 
provided polynucleotides as probes and in mapping and in tissue profiling, use of the 
corresponding polypeptides and other gene products to raise antibodies, and use of the 
5 polynucleotides and their encoded gene products for therapeutic and diagnostic purposes. 

L Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 
necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID 

10 NOS: 1-844; polynucleotides obtained from the biological materials described herein or other 
biological sources (particularly human sources) by hybridization under stringent conditions 
(particularly conditions of high stringency); genes corresponding to the provided 
polynucleotides; variants of the provided polynucleotides and their corresponding genes, 
particularly those variants that retain a biological activity of the encoded gene product (e.g., 

1 5 a biological activity ascribed to a gene product corresponding to the provided 

polynucleotides as a result of the assignment of the gene product to a protein family(ies) 
and/or identification of a functional domain present in the gene product). Other nucleic acid 
compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. 

20 The invention features polynucleotides that are expressed in cells of human tissue, 

specifically human colon, breast, and/or lung tissue. Novel nucleic acid compositions of the 
invention of particular interest comprise a sequence set forth in any one of SEQ ID NOS:l- 
844 or an identifying sequence thereof. An "identifying sequence" is a contiguous sequence 
of residues at least about 10 nt to about 20 nt in length, usually at least about 50 nt to about 

25 100 nt in length, that uniquely identifies a polynucleotide sequence, e.g., exhibits less than 
90%, usually less than about 80% to about 85% sequence identity to any contiguous 
nucleotide sequence of more than about 20 nt. Thus, the subject novel nucleic acid 
compositions include full length cDNAs or mRNAs that encompass an identifying sequence 
of contiguous nucleotides from any one of SEQ ID NOS: 1-844. 

30 The polynucleotides of the invention also include polynucleotides having sequence 

similarity or sequence identity. Nucleic acids having sequence similarity are detected by 
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hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M 
saline/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C in 
1XSSC. Sequence identity can be determined by hybridization under stringent conditions, 
for example, at 50°C or higher and 0.1XSSC (9 mM saline/0.9 mM sodium citrate). 
5 Hybridization methods and conditions are well known in the art, see, e.g., U.S. Patent No. 
5,707,829. Nucleic acids that are substantially identical to the provided polynucleotide 
sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 
provided polynucleotide sequences (SEQ ID NOS: 1-844) under stringent hybridization 
conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate 

10 homologous or related genes. The source of homologous genes can be any species, e.g. 

primate species, particularly human; rodents, such as rats and mice, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. 

Preferably, hybridization is performed using at least 15 contiguous nucleotides of at 
least one of SEQ ID NOS: 1-844. That is, when at least 15 contiguous nucleotides of one of 

1 5 the disclosed SEQ ID NOs. is used as a probe, the probe will preferentially hybridize with a 
gene or mRNA (of the biological material) comprising the complementary sequence, 
allowing the identification and retrieval of the nucleic acids of the biological material that 
uniquely hybridize to the selected probe. Probes from more than one SEQ ID NO. will 
hybridize with the same gene or mRNA if the cDNA from which they were derived 

20 corresponds to one mRNA. Probes of more than 15 nucleotides can be used, but 15 
nucleotides represents enough sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences (e.g., degenerate variants, allelic variants, etc.). Variants of the 
polynucleotides of the invention are identified by hybridization of putative variants with 

25 nucleotide sequences disclosed herein, preferably by hybridization under stringent conditions 
For example, by using appropriate wash conditions, variants of the polynucleotides of the 
invention can be identified where the allelic variant exhibits at most about 25-30% base pair 
mismatches relative to the selected polynucleotide probe. In general, allelic variants contain 
15-25% base pair mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% 

30 base pair mismatches, as well as a single base-pair mismatch. 
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The invention also encompasses homologs corresponding to the polynucleotides of 
SEQ ID NOS: 1 -844, where the source of homologous genes can be any mammalian species, 
e.g., primate species, particularly human; rodents, such as rats, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. Between mammalian species, e.g., human and 
5 mouse, homologs have substantial sequence similarity, e.g., at least 75% sequence identity, 
usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence 
similarity is calculated based on a reference sequence, which may be a subset of a larger 
sequence, such as a conserved motif, coding region, flanking region, etc. A reference 
sequence will usually be at least about 1 8 contiguous nt long, more usually at least about 30 

10 nt long, and may extend to the complete sequence that is being compared. Algorithms for 
sequence analysis are known in the art, such as BLAST, described in Altschul et al., J. Mol 
Biol (1990) 275:403-10. 

In general, variants of the invention have a sequence identity greater than at least 
about 65%, preferably at least about 75%, more preferably at least about 85%, and can be 

15 greater than at least about 90% or more as determined by the Smith- Waterman homology 
search algorithm as implemented in MPSRCH program (Oxford Molecular). For the 
purposes of this invention, a preferred method of calculating percent identity is the Smith- 
Waterman algorithm, using the following. Global DNA sequence identity must be greater 
than 65% as determined by the Smith-Waterman homology search algorithm as implemented 

20 in MPSRCH program (Oxford Molecular) using an affine gap search with the following 
search parameters: gap open penalty, 12; and gap extension penalty, 1 . 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments 
thereof, particularly fragments that encode a biologically active gene product and/or are 
useful in the methods disclosed herein {e.g., in diagnosis, as a unique identifier of a 

25 differentially expressed gene of interest, etc.). The term "cDNA" as used herein is intended 
to include all nucleic acids that share the arrangement of sequence elements found in native 
mature mRNA species, where sequence elements are exons and 3 and 5 non-coding 
regions. Normally mRNA species have contiguous exons, with the intervening introns, 
when present, being removed by nuclear RNA splicing, to create a continuous open reading 

30 frame encoding a polypeptide of the invention. 
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A genomic sequence of interest comprises the nucleic acid present between the 
initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It can further include the 3 and 5 
untranslated regions found in the mature mRNA. It can further include specific 
5 transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., 

including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 and 3 end 
of the transcribed region. The genomic DNA can be isolated as a fragment of 100 kbp or 
smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 
flanking the coding region, either 3 and 5 , or internal regulatory sequences as sometimes 

10 found in introns, contains sequences required for proper tissue, stage-specific, or disease- 
state specific expression. 

The nucleic acid compositions of the subject invention can encode all or a part of the 
subject differentially expressed polypeptides. Double or single stranded fragments can be 
obtained from the DNA sequence by chemically synthesizing oligonucleotides in accordance 

15 with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. 
Isolated polynucleotides and polynucleotide fragments of the invention comprise at least 
about 10, about 15, about 20, about 35, about 50, about 100, about 150 to about 200 ? about 
250 to about 300, or about 350 contiguous nucleotides selected from the polynucleotide 
sequences as shown in SEQ ID NOS: 1 -844. For the most part, fragments will be of at least 
20 1 5 nt, usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in length or 
more. In a preferred embodiment, the polynucleotide molecules comprise a contiguous 
sequence of at least twelve nucleotides selected from the group consisting of the 
polynucleotides shown in SEQ ID NOS: 1-844. 



25 polynucleotide sequences disclosed in SEQ ID NOS: 1-844. The probes are preferably at 
least about 12, 15, 16, 18, 20, 22, 24, or 25 nucleotide fragment of a corresponding 
contiguous sequence of SEQ ID NOS: 1-844, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb in 
length. The probes can be synthesized chemically or can be generated from longer 
polynucleotides using restriction enzymes. The probes can be labeled, for example, with a 

30 radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based upon an 
identifying sequence of a polynucleotide of one of SEQ ID NOS: 1 -844. More preferably, 



Probes specific to the polynucleotides of the invention can be generated using the 
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probes are designed based on a contiguous sequence of one of the subject polynucleotides 
that remain unmasked following application of a masking program for masking low 
complexity (e.g., XBLAST) to the sequence., i.e., one would select an unmasked region, as 
indicated by the polynucleotides outside the poly-n stretches of the masked sequence 
5 produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in substantial 
purity, generally as other than an intact chromosome. Usually, the polynucleotides, either as 
DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic acid 
sequences, generally being at least about 50%, usually at least about 90% pure and are 

1 0 typically "recombinant", e.g. , flanked by one or more nucleotides with which it is not 
normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule. They can be provided within autonomously replicating molecules 
(vectors) or within molecules without replication sequences. They can be regulated by their 

1 5 own or by other regulatory sequences, as is known in the art. The polynucleotides of the 
invention can be introduced into suitable host cells using a variety of techniques which are 
available in the art, such as transferrin polycation-mediated DNA transfer, transfection with 
naked or encapsulated nucleic acids, liposome-mediated DNA transfer, intracellular 
transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, 

20 gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological samples 
(e.g., extracts of human cells) to generate additional copies of the polynucleotides, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes or as 

25 triple-strand forming oligonucleotides. The probes described herein can be used to, for 
example, determine the presence or absence of the polynucleotide sequences as shown in 
SEQ ID NOS: 1 -844 or variants thereof in a sample. These and other uses are described in 
more detail below. 
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Use of Polynucleotides to Obtain Full-Length cDNA and Full-Length Human Gene and 
Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are obtained 
as follows. A polynucleotide having a sequence of one of SEQ ID NOS: 1-844, or a portion 
5 thereof comprising at least 12, 1 5, 18, or 20 nucleotides, is used as a hybridization probe to 
detect hybridizing members of a cDNA library using probe design methods, cloning 
methods, and clone selection techniques such as those described in U.S. Patent No. 
5,654,173. Libraries of cDNA are made from selected tissues, such as normal or tumor 
tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent. 

10 Preferably, the tissue is the same as the tissue from which the polynucleotides of the 
invention were isolated, as both the polynucleotides described herein and the cDNA 
represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. Alternatively, many cDNA libraries are available 
commercially. (Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed, (1989) 

1 5 Cold Spring Harbor Press, Cold Spring Harbor, NY). The choice of cell type for library 

construction can be made after the identity of the protein encoded by the gene corresponding 
to the polynucleotide of the invention is known. This will indicate which tissue and cell 
types are likely to express the related gene, and thus represent a suitable source for the 
mRNA for generating the cDNA. Where the provided polynucleotides are isolated from 

20 cDNA libraries, the libraries are prepared from mRNA of human colon cells, more 

preferably, human colon cancer cells, even more preferably, from a highly metastatic colon 
cell, Kml2L4-A. 

Techniques for producing and probing nucleic acid sequence libraries are described, 
for example, in Sambrook et al, Molecular Cloning: A Laboratory Manual 2nd Ed. , (1989) 
25 Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using 

primers based on sequence from SEQ ID NOS: 1-844. In one embodiment, the cDNA library 
can be made from only poly-adenylated mRNA. Thus, poly-T primers can be used to 
prepare cDNA from the mRNA. 



30 preferably that encompass the complete coding sequence of the native message, are obtained. 
In order to confirm that the entire cDNA has been obtained, RNA protection experiments 



Members of the library that are larger than the provided polynucleotides, and 
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are performed as follows. Hybridization of a full-length cDNA to an mRNA will protect the 
RNA from RNase degradation. If the cDNA is not full length, then the portions of the 
mRNA that are not hybridized will be subject to RNase degradation. This is assayed, as is 
known in the art, by changes in electrophoretic mobility on polyacrylamide gels, or by 
5 detection of released monoribonucleotides. Sambrook et aL, Molecular Cloning: A 

Laboratory Manual 2nd Ed., (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. In 
order to obtain additional sequences 5' to the end of a partial cDNA, 5* RACE (PCR 
Protocols: A Guide to Methods and Applications, (1990) Academic Press, Inc.) is performed. 



1 0 the isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions 

thereof, are used as probes to libraries of genomic DNA. Preferably, the library is obtained 
from the cell type that was used to generate the polynucleotides of the invention, but this is 
not essential. Most preferably, the genomic DNA is obtained from the biological material 
described herein in the Examples. Such libraries can be in vectors suitable for carrying large 

1 5 segments of a genome, such as PI or YAC, as described in detail in Sambrook et aL, 9.4- 
9.30. In addition, genomic sequences can be isolated from human BAC libraries, which are 
commercially available from Research Genetics, Inc., Huntville, Alabama, USA, for 
example. In order to obtain additional 5' or 3' sequences, chromosome walking is performed, 
as described in Sambrook et aL, such that adjacent and overlapping fragments of genomic 

20 DNA are isolated. These are mapped and pieced together, as is known in the art, using 
restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes 
can be isolated using both classical and PCR methods to construct and probe cDNA libraries. 
Using either method, Northern blots, preferably, are performed on a number of cell types to 

25 determine which cell lines express the gene of interest at the highest level. Classical 

methods of constructing cDNA libraries are taught in Sambrook et aL, supra. With these 
methods, cDNA can be produced from mRNA and inserted into viral or expression vectors. 
Typically, libraries of mRNA comprising poly(A) tails can be produced with poly(T) 
primers. Similarly, cDNA libraries can be produced using the instant sequences as primers. 

30 PCR methods are used to amplify the members of a cDNA library that comprise the 

desired insert. In this case, the desired insert will contain sequence from the full length 



Genomic DNA is isolated using the provided polynucleotides in a manner similar to 
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cDNA that corresponds to the instant polynucleotides. Such PCR methods include gene 

trapping and RACE methods. Gene trapping entails inserting a member of a cDNA library 

into a vector. The vector then is denatured to produce single stranded molecules. Next, a 

substrate-bound probe, such a biotinylated oligo, is used to trap cDNA inserts of interest. 

5 Biotinylated probes can be linked to an avidin-bound solid substrate. PCR methods can be 

used to amplify the trapped cDNA. To trap sequences corresponding to the full length 

genes, the labeled probe sequence is based on the polynucleotide sequences of the invention. 

Random primers or primers specific to the library vector can be used to amplify the trapped 

cDNA. Such gene trapping techniques are described in Gruber et al, WO 95/04745 and 

10 Gruber et al, U.S. Pat. No. 5,500,356. Kits are commercially available to perform gene 

trapping experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying 
cDNAs from a number of different RNAs. The cDNAs are ligated to an oligonucleotide 
linker, and amplified by PCR using two primers. One primer is based on sequence from the 

15 instant polynucleotides, for which full length sequence is desired, and a second primer 

comprises sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. A 
description of this methods is reported in WO 97/191 1 0. In preferred embodiments of 
RACE, a common primer is designed to anneal to an arbitrary adaptor sequence ligated to 
cDNA ends (Apte and Siebert, Biotechniques (1993) 75:890-893; Edwards et aL 9 Nuc. Acids 

20 Res. (1991) 79:5227-5232). When a single gene-specific RACE primer is paired with the 
common primer, preferential amplification of sequences between the single gene specific 
primer and the common primer occurs. Commercial cDNA pools modified for use in RACE 
are available. 

Another PCR-based method generates full-length cDNA library with anchored ends 
25 without needing specific knowledge of the cDNA sequence. The method uses lock-docking 
primers (I- VI), where one primer, poly TV (I-III) locks over the polyA tail of eukaryotic 
mRNA producing first strand synthesis and a second primer, polyGH (IV-VI) locks onto the 
polyC tail added by terminal deoxynucleotidyl transferase (TdT). This method is described 
in WO 96/40998. 

30 The promoter region of a gene generally is located 5' to the initiation site for RNA 

polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as 
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TATTA or TATAA, which is sensitive to mutations. The promoter region can be obtained 
by performing 5' RACE using a primer from the coding region of the gene. Alternatively, 
the cDNA can be used as a probe for the genomic sequence, and the region 5' to the coding 
region is identified by "walking up." If the gene is highly expressed or differentially 
5 expressed, the promoter from the gene can be of use in a regulatory construct for a 
heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be 
prepared by site-directed mutagenesis, described in detail in Sambrook et aL, 15.3-15.63. 
The choice of codon or nucleotide to be replaced can be based on disclosure herein on 

10 optional changes in amino acids to achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, 
nucleic acid comprising nucleotides having the sequence of one or more polynucleotides of 
the invention can be synthesized. Thus, the invention encompasses nucleic acid molecules 
ranging in length from 15 nucleotides (corresponding to at least 15 contiguous nucleotides of 

15 one of SEQ ID NOS: 1 -844) up to a maximum length suitable for one or more biological 
manipulations, including replication and expression, of the nucleic acid molecule. The 
invention includes but is not limited to (a) nucleic acid having the size of a full gene, and 
comprising at least one of SEQ ID NOS: 1-844; (b) the nucleic acid of (a) also comprising at 
least one additional gene, operably linked to permit expression of a fusion protein; (c) an 

20 expression vector comprising (a) or (b); (d) a plasmid comprising (a) or (b) ; and (e) a 

recombinant viral particle comprising (a) or (b). Once provided with the polynucleotides 
disclosed herein, construction or preparation of (a) - (e) are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nucleotides of at 
least any one of SEQ ID NOS: 1-844, preferably the entire sequence of at least any one of 

25 SEQ ID NOS: 1 -844, is not limited and can be any sequence of A, T, G, and/or C (for DNA) 
and A, U, G, and/or C (for RNA) or modified bases thereof, including inosine and 
pseudouridine. The choice of sequence will depend on the desired function and can be 
dictated by coding regions desired, the intron-like regions desired, and the regulatory regions 
desired. Where the entire sequence of any one of SEQ ID NOS: 1-844 is within the nucleic 

30 acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 
sequence of any one of SEQ ID NOS: 1-844. 
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II- Expression of Polypeptide Encoded by Full-Length cDNA or Full-Length Gene 

The provided polynucleotide (e.g., a polynucleotide having a sequence of one of SEQ 
ID NOS: 1-844), the corresponding cDNA, or the full-length gene is used to express a partial 
5 or complete gene product. 

Constructs of polynucleotides having sequences of SEQ ID NOS: 1-844 can be 
generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid 
from large numbers of oligodeoxyribonucleotides is described by, e.g., Stemmer et al., Gene 
(Amsterdam) (1995) 164(1) A9-5Z. In this method, assembly PCR (the synthesis of long 
10 DNA sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. 
The method is derived from DNA shuffling (Stemmer, Nature (1994) 370:389-391), and 
does not rely on DNA ligase, but instead relies on DNA polymerase to build increasingly 
longer DNA fragments during the assembly process. For example, a 1.1 -kb fragment 
containing the TEM-1 beta-lactamase-encoding gene (bla) can be assembled in a single 
1 5 reaction from a total of 56 oligos, each 40 nucleotides (nt) in length. The synthetic gene can 
be PCR amplified and cloned in a vector containing the tetracycline-resistance gene (Tc-R) 
as the sole selectable marker. Without relying on ampicillin (Ap) selection, 76% of the Tc-R 
colonies were Ap-R, making this approach a general method for the rapid and cost-effective 
synthesis of any gene. 

20 Appropriate polynucleotide constructs are purified using standard recombinant DNA 

techniques as described in, for example, Sambrook et aL, Molecular Cloning: A Laboratory 
Manual 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under 
current regulations described in United States Dept. of HHS, National Institute of Health 
(NIH) Guidelines for Recombinant DNA Research. The gene product encoded by a 

25 polynucleotide of the invention is expressed in any expression system, including, for 

example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable vectors and 
host cells are described in U.S. Patent No. 5,654,173. 

Bacteria. Expression systems in bacteria include those described in Chang et al, 
Nature (1978) 275:615; Goeddel et al, Nature (1979) 281:544; Goeddel et al, Nucleic Acids 



30 Res. (1980) 5:4057; EP 0 036,776; U.S. Patent No. 4,551,433; DeBoere/ a/., Proc. Natl. 
Acad Sci. (USA) (1983) 50:21-25; and Siebenlist et al, Cell (1980) 20:269. 
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Yeast. Expression systems in yeast include those described in Hinnen et ah, Proc. 
Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et al.,J. Bacteriol. (1983) 153:163; Kurtz et al., 
Mol. Cell. Biol. (1986) 6": 142; Kunze et al., J. Basic Microbiol. (1985) 25:141; Gleeson et 
al.,J. Gen. Microbiol. (1986) 752:3459; Roggenkamp et al., Mol. Gen. Genet. (1986) 
5 202:302; Das et al., J. Bacteriol. (1 984) 158: 1 1 65; De Louvencourt et al., J. Bacteriol. 
(1983) 154:131; Van den Berg et al, Bio/Technology (1990) 5:135; Kunze et al., J. Basic 
Microbiol. (1985) 25:141; Cregg et al., Mol. Cell. Biol. (1985) 5:3376; U.S. Patent Nos. 
4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:106; Davidow et al., Curr. 
Genet. (1985) 70:380; Gaillardin et al, Curr. Genet. (1985) 70:49; Ballance et al, Biochem. 

10 Biophys. Res. Commun. (1983) 772:284-289; Tilburn et al, Gene (1983) 26:205-221; Yelton 
et al, Proc. Natl. Acad. Sci. (USA) (1984) 57:1470-1474; Kelly and Hynes, EMBOJ. (1985) 
4:475479; EP 0 244,234; and WO 91/00357. 

Insect Cells. Expression of heterologous genes in insects is accomplished as 
described in U.S. Patent No. 4,745,051 ; Friesen et al, "The Regulation of Baculovirus Gene 

1 5 Expression", in: The Molecular Biology Of Baculoviruses (1 986) (W. Doerfler, ed.); EP 0 
127,839; EP 0 155,476; and Vlak et al, J. Gen. Virol. (1988) 69:165-116; Miller et al, Ann. 
Rev. Microbiol. (1988) 42:111; Carbonell et al, Gene (1988) 75:409; Maeda et al, Nature 
(1985) 575:592-594; Lebacq-Verheyden et al, Mol. Cell. Biol. (1988) 5:3129; Smith et al, 
Proc. Natl. Acad. Sci. (USA) (1985) 52:8844; Miyajima et al, Gene (1987) 55:273; and 

20 Martin et al, DNA (1988) 7:99. Numerous baculoviral strains and variants and 

corresponding permissive insect host cells from hosts are described in Luckow et al, 
Bio/Technology (1988) 6:47-55, Miller et al, Generic Engineering (1986) 5:277-279, and 
Maeda et al, Nature (1985) 575:592-594. 



25 et al, EMBOJ. (1985) 4:161, Gorman et al, Proc. Natl. Acad. Sci. (USA) (1982) 79:6111, 
Boshart et al, Cell (1985) 47:521 and U.S. Patent No. 4,399,216. Other features of 
mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. (1979) 
55:44, Barnes and Sato, Anal. Biochem. (1980) 702:255, U.S. Patent Nos. 4,767,704, 
4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 30,985. 

30 Polynucleotide molecules comprising a polynucleotide sequence provided herein 

propagated by placing the molecule in a vector. Viral and non-viral vectors are used, 



Mammalian Cells. Mammalian expression is accomplished as described in Dijkema 



15 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 




PCT/US98/27610 



including plasmids. The choice of plasmid will depend on the type of cell in which 
propagation is desired and the purpose of propagation. Certain vectors are useful for 
amplifying and making large amounts of the desired DNA sequence. Other vectors are 
suitable for expression in cells in culture. Still other vectors are suitable for transfer and 
5 expression in cells in a whole animal or person. The choice of appropriate vector is well 
within the skill of the art. Many such vectors are available commercially. The partial or 
full-length polynucleotide is inserted into a vector typically by means of DNA ligase 
attachment to a cleaved restriction enzyme site in the vector. Alternatively, the desired 
nucleotide sequence can be inserted by homologous recombination in vivo. Typically this is 
10 accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a 
portion of the desired nucleotide sequence, for example. 



15 length polynucleotides are linked to regulatory sequences as appropriate to obtain the desired 
expression properties. These can include promoters (attached either at the 5' end of the sense 
strand or at the 3' end of the antisense strand), enhancers, terminators, operators, repressors, 
and inducers. The promoters can be regulated or constitutive. In some situations it may be 
desirable to use conditionally active promoters, such as tissue-specific or developmental 

20 stage-specific promoters. These are linked to the desired nucleotide sequence using the 

techniques described above for linkage to vectors. Any techniques known in the art can be 
■ used. 

When any of the above host cells, or other appropriate host cells or organisms, are 
used to replicate and/or express the polynucleotides or nucleic acids of the invention, the 
25 resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope 
of the invention as a product of the host cell or organism. The product is recovered by any 
appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression 
can be regulated in the cell to which the gene is native. For example, an endogenous gene of 
30 a cell can be regulated by an exogenous regulatory sequence as disclosed in U.S. Patent No. 
5,641,670. 



The polynucleotides set forth in SEQ ID NOS: 1-844 or their corresponding full- 
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III. Identification of Functional and Structural Motifs of Novel Genes 

A. Screening Polynucleotide Sequences and Amino Acid Sequences Against 
Publicly Available Databases 



full genes can be aligned with individual known sequences. Similarity with individual 
sequences can be used to determine the activity of the polypeptides encoded by the 
polynucleotides of the invention. For example, sequences that show similarity with a 
chemokine sequence can exhibit chemokine activities. Also, sequences exhibiting similarity 

10 with more than one individual sequence can exhibit activities that are characteristic of either 
or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the 
nearest neighbors can be used as probes and primers to identify and isolate the full length 
sequence corresponding to provided polynucleotides. The nearest neighbors can indicate a 

1 5 tissue or cell type to be used to construct a library for the full-length sequences 
corresponding to the provided polynucleotides.. 

Typically, a selected polynucleotide is translated in all six frames to determine the 
best alignment with the individual sequences. The sequences disclosed herein in the 
Sequence Listing are in a 5' to 3' orientation and translation in three frames can be sufficient 

20 (with a few specific exceptions as described in the Examples). These amino acid sequences 
are referred to, generally, as query sequences, which will be aligned with the individual 
sequences. Databases with individual sequences are described in "Computer Methods for 
Macromolecular Sequence Analysis" Methods in Enzymology (1996) 266, Doolittle, 
Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. 

25 Databases include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and computer 
programs described above, and include BLAST, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BLAST/ . Another alignment algorithm is Fasta, available in the 
Genetics Computing Group (GCG) package, Madison, Wisconsin, USA, a wholly owned 

30 subsidiary of Oxford Molecular Group, Inc. Other techniques for alignment are described in 
Doolittle, supra. Preferably, an alignment program that permits gaps in the sequence is 



5 



Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or 
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utilized to align the sequences. The Smith- Waterman is one type of algorithm that permits 

gaps in sequence alignments. See Meth. Mol Biol (1997) 70: 173-187. Also, the GAP 
program using the Needleman and Wunsch alignment method can be utilized to align 
sequences. An alternative search strategy uses MPSRCH software, which runs on a 
5 MASPAR computer. MPSRCH uses a Smith- Waterman algorithm to score sequences on a 
massively parallel computer. This approach improves ability to identify sequences that are 
distantly related matches, and is especially tolerant of small gaps and nucleotide sequence 
errors. Amino acid sequences encoded by the provided polynucleotides can be used to 
search both protein and DNA databases. 
10 Results of individual and query sequence alignments can be divided into three 

categories, high similarity, weak similarity, and no similarity. Individual alignment results 
ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. Parameters for categorizing individual results include: percentage 
of the alignment region length where the strongest alignment is found, percent sequence 
1 5 identity, and p value. 

The percentage of the alignment region length is calculated by counting the number 
of residues of the individual sequence found in the region of strongest alignment, e.g., 
contiguous region of the individual sequence that contains the greatest number of residues 
that are identical to the residues of the corresponding region of the aligned query sequence. 
20 This number is divided by the total residue length of the query sequence to calculate a 

percentage. For example, a query sequence of 20 amino acid residues might be aligned with 
a 20 amino acid region of an individual sequence. The individual sequence might be 
identical to amino acid residues 5, 9-15, and 17-19 of the query sequence. The region of 
strongest alignment is thus the region stretching from residue 9-19, an 1 1 amino acid stretch. 
25 The percentage of the alignment region length is: 1 1 (length of the region of strongest 
alignment) divided by (query sequence length) 20 or 55%. 

Percent sequence identity is calculated by counting the number of amino acid 
matches between the query and individual sequence and dividing total number of matches by 
the number of residues of the individual sequences found in the region of strongest 
30 alignment. Thus, the percent identity in the example above would be 10 matches divided by 
1 1 amino acids, or approximately, 90.9% 
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P value is the probability that the alignment was produced by chance. For a single 
alignment, the p value can be calculated according to Karlin et ah, Proc. Natl. Acad. Sci. 
(1990) 57:2264 and Karlin et aL, Proc, Natl Acad Sci. (1993) 90. The p value of multiple 
alignments using the same query sequence can be calculated using an heuristic approach 
5 described in Altschul et ah, Nat. Genet. (1994) 6:1 19. Alignment programs such as BLAST 
program can calculate the p value. 

Another factor to consider for determining identity or similarity is the location of the 
similarity or identity. Strong local alignment can indicate similarity even if the length of 
alignment is short. Sequence identity scattered throughout the length of the query sequence 

10 also can indicate a similarity between the query and profile sequences. The boundaries of 
the region where the sequences align can be determined according to Doolittle, supra; 
BLAST or FAST programs; or by determining the area where sequence identity is highest. 

High Similarity. In general, in alignment results considered to be of high similarity, 
the percent of the alignment region length is typically at least about 55% of total length 

1 5 query sequence; more typically, at least about 58%; even more typically; at least about 60% 
of the total residue length of the query sequence. Usually, percent length of the alignment 
region can be as much as about 62%; more usually, as much as about 64%; even more 
usually, as much as about 66%. Further, for high similarity, the region of alignment, 
typically, exhibits at least about 75% of sequence identity; more typically, at least about 

20 78%; even more typically; at least about 80% sequence identity. Usually, percent sequence 
identity can be as much as about 82%; more usually, as much as about 84%; even more 
usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is found, 
the query sequence is considered to have high similarity with a profile sequence when the p 

25 value is less than or equal to about 10" 2 ; more usually; less than or equal to about 10 3 ; even 
more usually; less than or equal to about 10" 4 . More typically, the p value is no more than 
about 10" 5 ; more typically; no more than or equal to about 10" 10 ; even more typically; no 
more than or equal to about 10" 15 for the query sequence to be considered high similarity. 



30 similarity, there is no minimum percent length of the alignment region nor minimum length 
of alignment. A better showing of weak similarity is considered when the region of 



Weak Similarity. In general, where alignment results considered to be of weak 
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alignment is, typically, at least about 15 amino acid residues in length; more typically, at 
least about 20; even more typically; at least about 25 amino acid residues in length. Usually, 
length of the alignment region can be as much as about 30 amino acid residues; more 
usually, as much as about 40; even more usually, as much as about 60 amino acid residues. 
5 Further, for weak similarity, the region of alignment, typically, exhibits at least about 35% of 
sequence identity; more typically, at least about 40%; even more typically; at least about 
45% sequence identity. Usually, percent sequence identity can be as much as about 50%; 
more usually, as much as about 55%; even more usually, as much as about 60%. 



10 with a profile sequence when the p value is usually less than or equal to about 10" 2 ; more 
usually; less than or equal to about 10" 3 ; even more usually; less than or equal to about 10" 4 . 
More typically, the p value is no more than about 10" 5 ; more usually; no more than or equal 
to about 10 10 ; even more usually; no more than or equal to about 10" 15 for the query sequence 
to be considered weak similarity. 

1 5 Similarity Determined by Sequence Identity Alone. Sequence identity alone can be 

used to determine similarity of a query sequence to an individual sequence and can indicate 
the activity of the sequence. Such an alignment, preferably, permits gaps to align sequences. 
Typically, the query sequence is related to the profile sequence if the sequence identity over 
the entire query sequence is at least about 15%; more typically, at least about 20%; even 

20 more typically, at least about 25%; even more typically, at least about 50%. Sequence 

identity alone as a measure of similarity is most useful when the query sequence is usually, 
at least 80 residues in length; more usually, 90 residues; even more usually, at least 95 amino 
acid residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more preferably, 

25 120 residues in length; even more preferably, 150 amino acid residues in length. 

Determining Activity from Alignments with Profile and Multiple Aligned Sequences. 
Translations of the provided polynucleotides can be aligned with amino acid profiles that 
define either protein families or common motifs. Also, translations of the provided 
polynucleotides can be aligned to multiple sequence alignments (MSA) comprising the 

30 polypeptide sequences of members of protein families or motifs. Similarity or identity with 
profile sequences or MSAs can be used to determine the activity of the gene products (e.g., 



If low similarity is found, the query sequence is considered to have weak similarity 
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polypeptides) encoded by the provided polynucleotides or corresponding cDNA or genes. 
For example, sequences that show an identity or similarity with a chemokine profile or MSA 
can exhibit chemokine activities. 

Profiles can designed manually by (1) creating an MSA, which is an alignment of the 
5 amino acid sequence of members that belong to the family and (2) constructing a statistical 
representation of the alignment. Such methods are described, for example, in Birney et al, 
Nucl. Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein families and motifs are 
publicly available. For example, http : //genome. wustl. edu/Pfam/ includes MSAs of 547 
different families and motifs. These MSAs are described also in Sonnhammer et al, 

1 0 Proteins ( 1 997) 28: 405-420. Other sources over the world wide web include the site at 
http://wA\Av.embl-heidelberg.de/argos/ali/ali.html ; alternatively, a message can be sent to 
ALI@EMBL-HEIDELBERG.DE for the information. A brief description of these MSAs is 
reported in Pascarella et aL, Prot. Eng. (1996) 9(^:249-251. Techniques for building 
profiles from MSAs are described in Sonnhammer et al, supra; Birney et al. 9 supra; and 

1 5 "Computer Methods for Macromolecular Sequence Analysis," Methods in Enzymology 

(1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. 

Similarity between a query sequence and a protein family or motif can be determined 
by (a) comparing the query sequence against the profile and/or (b) aligning the query 
20 sequence with the members of the family or motif. Typically, a program such as Searchwise 
is used to compare the query sequence to the statistical representation of the multiple 
alignment, also known as a profile. The program is described in Birney et aL y supra. Other 
techniques to compare the sequence and profile are described in Sonnhammer et al., supra 
and Doolittle, supra, 

25 Next, methods described by Feng et aL, J. Mol. Evol (1987) 25:351 and Higgins et 

al, CABIOS (1989) 5:151 can be used align the query sequence with the members of a 
family or motif, also known as a MSA. Computer programs, such as PILEUP, can be used. 
See Feng et aL, infra. In general, the following factors are used to determine if a similarity 
between a query sequence and a profile or MSA exists: (1) number of conserved residues 

30 found in the query sequence, (2) percentage of conserved residues found in the query 
sequence, (3) number of frameshifts, and (4) spacing between conserved residues. 
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Some alignment programs that both translate and align sequences can make any 
number of frameshifts when translating the nucleotide sequence to produce the best 
alignment. The fewer frameshifts needed to produce an alignment, the stronger the 
similarity or identity between the query and profile or MSAs. For example, a weak 
5 similarity resulting from no frameshifts can be a better indication of activity or structure of a 
query sequence, than a strong similarity resulting from two frameshifts. Preferably, three or 
fewer frameshifts are found in an alignment; more preferably two or fewer frameshifts; even 
more preferably, one or fewer frameshifts; even more preferably, no frameshifts are found in 
an alignment of query and profile or MSAs. 

10 Conserved residues are those amino acids found at a particular position in all or some 

of the family or motif members. For example, most chemokines contain four conserved 
cysteines. Alternatively, a position is considered conserved if only a certain class of amino 
acids is found in a particular position in all or some of the family members. For example, 
the N-terminal position can contain a positively charged amino acid, such as lysine, arginine, 

15 or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a 

single amino acid is found at a particular position in at least about 40% of all class members; 

more typically, at least about 50%; even more typically, at least about 60% of the members. 

Usually, a residue is conserved when a class or single amino acid is found in at least about 
20 70% of the members of a family or motif; more usually, at least about 80%; even more 

usually, at least about 90%; even more usually, at least about 95%. 

A residue is considered conserved when three unrelated amino acids are found at a 

particular position in the some or all of the members; more usually, two unrelated amino 

acids. These residues are conserved when the unrelated amino acids are found at particular 
25 positions in at least about 40% of all class member; more typically, at least about 50%; even 

more typically, at least about 60% of the members. Usually, a residue is conserved when a 

class or single amino acid is found in at least about 70% of the members of a family or motif; 

more usually, at least about 80%; even more usually, at least about 90%; even more usually, 

at least about 95%. 

30 A query sequence has similarity to a profile or MSA when the query sequence 

comprises at least about 25% of the conserved residues of the profile or MSA; more usually, 
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at least about 30%; even more usually; at least about 40%. Typically, the query sequence 
has a stronger similarity to a profile sequence or MSA when the query sequence comprises at 
least about 45% of the conserved residues of the profile or MSA; more typically, at least 
about 50%; even more typically; at least about 55%. 



Profiles 

The identify and function of the gene that correlates to a polynucleotide described 
herein can be determined by screening the polynucleotides or their corresponding amino acid 



10 motifs among proteins of each family. Publicly available profiles are described above in 

Section IVA. Additional or alternative profiles are described below. 

In comparing a novel polynucleotide with known sequences, several alignment tools 

are available. Examples include PileUp, which creates a multiple sequence alignment, and is 

described in Feng et al, J. Mol Evol (1987) 25:351. Another method, GAP, uses the 
15 alignment method of Needleman et al, J. Mol Biol (1970) 45:443. GAP is best suited for 

global alignment of sequences. A third method, BestFit, functions by inserting gaps to 

maximize the number of matches using the local homology algorithm of Smith et al. 9 Adv. 

Appl Math. (1981) 2:482. Exemplary protein profiles are provided below and in the 

examples. 

20 Chemokines. Chemokines are a family of proteins that have been implicated in 

lymphocyte trafficking, inflammatory diseases, angiogenesis, hematopoiesis, and viral 
infection. See, for example, Rollins, Blood (1997) 90(^:909-928, and Wells et al, J. Leuk. 
Biol (1997) 67:545-550. U.S. Patent No. 5,605,817 discloses DNA encoding a chemokine 
expressed in fetal spleen. U.S. Patent No. 5,656,724 discloses chemokine-like proteins and 

25 methods of use. U.S. Patent No. 5,602,008 discloses DNA encoding a chemokine expressed 
by liver. 

Chemokine mutants are polypeptides having an amino acid sequence that possesses 
at least one amino acid substitution, addition, or deletion as compared to native chemokines. 
Fragments possess the same amino acid sequence of the native chemokines; mutants can 
30 lack the amino and/or carboxyl terminal sequences. Fusions are mutants, fragments, or 

native chemokines that also include amino and/or carboxyl terminal amino acid extensions. 



5 



B. 



Screening Polynucleotide and Amino Acid Sequences Against Protein 



sequences against profiles of protein families. Such profiles focus on common structural 
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The number or type of the amino acid changes is not critical, nor is the length or 
number of the amino acid deletions, or amino acid extensions that are incorporated in the 
chemokines as compared to the native chemokine amino acid sequences. A polynucleotide 
encoding one of these variant polypeptides will retain at least about 80% amino acid identity 
5 with at least one known chemokine. Preferably, these polypeptides will retain at least about 
85% amino acid sequence identity, more preferably, at least about 90%; even more 
preferably, at least about 95%. In addition, the variants exhibit at least 80%; preferably 
about 90%; more preferably about 95% of at least one activity exhibited by a native 
chemokine, which includes immunological, biological, receptor binding, and signal 
10 transduction functions. 

Assays for chemotaxis relating to neutrophils are described in Walz et al, Biochem. 
Biophys. Res. Commun. (1987) 149:155, Yoshimura et al, Proc. Natl. Acad. Sci. (USA) 

(1987) 84:9233, and Schroder et al, J. Immunol (1987) 7 39:3474; to lymphocytes, Larsen et 
al, Science (1989) 243:1464, Carr et al, Proc. Natl Acad. Sci. (USA) (1994) 97:3652; to 

1 5 tumor-infiltrating lymphocytes, Liao et al, J. Exp. Med (1 995). 182: 1 301 ; to hematopoietic 
progenitors, Aiuti et al, J. Exp. Med. (1997) 185:1 1 1; to monocytes, Valente et al, Biochem. 

(1988) 27:4162; and to natural killer cells, Loetscher et al, J. Immunol (1996) 156:322, and 
Allavena et al, Eur. J. Immunol (1994) 2¥:3233. 



20 in Dahinden et al, J. Exp. Med (1994) 779:751, Weber et al, J. Immunol (1995) 154:4166, 
and Noso et al, Biochem. Biophys. Res. Commun. (1994) 200:1470; for attracting dendritic 
cells, Sozzani et al, J. Immunol (1995) 755:3292; for attracting basophils, in Dahinden et 
al, J. Exp, Med (1994) 779:751, Alam et aL, J. Immunol (1994) 752:1298, Alam et al, J. 
Exp. Med. (1992) 776:781; and for activating neutrophils, Maghazaci et al, Eur. J. Immunol 

25 (1996) 26:315, and Taub et aL, J. Immunol (1995) 755:3877. Native chemokines can act as 
mitogens for fibroblasts, assayed as described in Mullenbach et al, J. Biol Chem. (1986) 
267:719. 

Native chemokines exhibit binding activity with a number of receptors. Description 
of such receptors and assays to detect binding are described in, for example, Murphy et al, 
30 Science (1991) 255:1280; Combadiere et al, J. Biol Chem. (1995) 270/29671 ; Daugherty et 
al, J. Exp. Med. (1996) 755:2349; Samson et al, Biochem. (1996) 55:3362; Raport et al, J. 



Assays for determining the biological activity of attracting eosinophils are described 
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Biol Chem. (1996) 277:17161; Combadiere etal,J. Leukoc. Biol (1996) 60:147; Baba et 
al,J. Biol Chem. (1997) 25:14893; Yosida et al, J. Biol Chem. (1997) 272:13803; 
Arvannitakis et al, Nature (1997) 555:347, and other assays are known in the art. 

Assays for kinase activation of chemokines are described by Yen et al, J. Leukoc. 
5 Biol (1997) (57:529; Dubois et al y J. Immunol. (1996) 756:1356; Turner et al, J. Immunol 

(1995) 755:2437. Assays for inhibition of angiogenesis or cell proliferation are described in 
Maione et al, Science (1990) 247:11. Glycosaminoglycan production can be induced by 
native chemokines, assayed as described in Castor et al, Proc. Natl Acad. ScL (USA) (1983) 
80:165. Chemokine-mediated histamine release from basophils is assayed as described in 

10 Dahinden et al, J. Exp. Med. (1989) 770:1787; and White et al, Immunol Lett. (1989) 
22:151. Heparin binding is described in Luster et al, J. Exp. Med. (1995) 752:219. 

Chemokines can possess dimerization activity, which can be assayed according to 
Burrows et al, Biochem. (1994) 55:12741; and Zhang et al y Mol Cell Biol (1995) 75:4851. 
Native chemokines can play a role in the inflammatory response of viruses. This activity 

1 5 can be assayed as described in Bleul et al, Nature (1996) 552:829; and Oberlin et al, Nature 

(1996) 552:833. Exocytosis of monocytes can be promoted by native chemokines. The 
assay for such activity is described in Uguccioni et al, Eur. J. Immunol. (1995) 25:64. 
Native chemokines also can inhibit hematopoietic stem cell proliferation. The method for 
testing for such activity is reported in Graham et al, Nature (1990) 344:442. 

20 Death Domain Proteins. Several protein families contain death domain motifs 

(Feinstein and Kimchi, TIBS Letters (1995) 20:242). Some death domain containing 
proteins are implicated in cytotoxic intracellular signaling (Cleveland et al, Cell (1995) 
57:479, Pan et al, Science (1997) 276:1 1 1 ; Duan et al, Nature (1997) 555:86-89, and 
Chinnaiyan et al, Science (1996) 274:990). U.S. Patent No. 5,563,039 describes a protein 

25 homologous to TRADD (Tumor Necrosis Factor Receptor- 1 Associated Death Domain 
containing protein), and modifications of the active domain of TRADD that retain the 
functional characteristics of the protein, as well as apoptosis assays for testing the function of 
such death domain containing proteins. U.S. Patent No. 5,658,883 discloses biologically 
active TGF-B 1 peptides. U.S. Patent No. 5,674,734 discloses RIP, which contains a C- 

30 terminal death domain and an N-terminal kinase domain. 



25 



BNSDOCID: <WO. 



.9933982A2_L> 




WO 99/33982 



PCT/US98/27610 



Leukemia Inhibitory Factor (LIFV An LIF profile is constructed from sequences of 
leukemia inhibitor factor, CT-1 (cardiotrophin-1), CNTF (ciliary neurotrophic factor), OSM 
(oncostatin M), and IL-6 (interleukin-6). This profile encompasses a family of secreted 
cytokines that have pleiotropic effects on many cell types including hepatocytes, osteoclasts, 
5 neuronal cells and cardiac myocytes, and can be used to detect additional genes encoding 
such proteins. These molecules are all structurally related and share a common co-receptor 
gpl30 which mediates intracellular signal transduction by cytoplasmic tyrosine kinases such 
as src. 



1 0 and to function in the development of a variety of cell types. Thus new members of this 
family would be candidates to be developed as growth or survival factors for the cell types 
that they stimulate. For more details on this family of cytokines, see Pennica et al, Cytokine 
and Growth Factor Reviews (1996) 7:81-91. U.S. Patent No. 5,420,247 discloses LIF 
receptor and fusion proteins. U.S. Patent No. 5,443,825 discloses human LIF. 

15 Angiopoietin. Angiopoietin-1 is a secreted ligand of the TIE-2 tyrosine kinase; it 

functions as an angiogenic factor critical for normal vascular development. Angiopoietin-2 is 
a natural antagonist of angiopoietin-1 and thus functions as an anti-angiogenic factor. These 
two proteins are structurally similar and activate the same receptor (Folkman et al., Cell 
(1996) 87 A 153, and Davis et al, Cell (1996) 57:1 161). The angiopoietin molecules are 

20 composed of two domains: a coiled-coil region and a region related to fibrinogen. The 
fibrinogen domain is found in many molecules including ficolin and tesascin, and is well 
defined structurally with many members. 

Receptor Protein-Tvrosine Kinases. Receptor Protein-Tyrosine Kinases or RPTKs 
are described in Lindberg, Annu. Rev. Cell Biol (1994) 70:251-337. 

25 Growth Factors: (Epidermal Growth Factor) EGF and (Fibroblast Growth Factor) 

FGF. For a discussion of growth factor superfamilies, see Growth Factors: A Practical 
Approach, (Appendix Al) (1993) McKay and Leigh, Oxford University Press, NY, 237-243. 
U.S. Patent No. 4,444,760 discloses acidic brain fibroblast growth factor, which is active in 
the promotion of cell division and wound healing. U.S. Patent No. 5,439,818 discloses DNA 

30 encoding human recombinant basic fibroblast growth factor, which is active in wound 

healing. U.S. Patent No. 5,604,293 discloses recombinant human basic fibroblast growth 



Novel proteins related to this family are also likely to be secreted, to activate gpl30 
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factor, which is useful for wound healing. U.S. Patent No. 5,410,832 discloses brain-derived 
and recombinant acidic fibroblast growth factor, which act as mitogens for mesoderm and 
neuroectoderm-derived cells in culture, and promote wound healing in soft tissue, 
cartilaginous tissue and musculoskeletal tissue. U.S. Patent No. 5387,673 discloses 
5 biologically active fragments of FGF. 

Proteins of the TNF Family. A profile derived from the TNF family is created by 
aligning sequences of the following TNF family members: nerve growth factor (NGF), 
lymphotoxin, Fas ligand, tumor necrosis factor (TNFot), CD40 ligand, TRAIL, ox40 ligand, 
4- IBB ligand, CD27 ligand, and CD30 ligand. The profile is designed to identify sequences 

10 of proteins that constitute new members or homologues of this family of proteins. U.S. 
Patent No. 5,606,023 discloses mutant TNF proteins; U.S. Patent No. 5,597,899 and U.S. 
Patent No. 5,486,463 disclose TNF muteins; and U.S. Patent No. 5,652,353 discloses DNA 
encoding TNFa muteins. 

Members of the TNF family of proteins have been show in vitro to multimerize, as 

1 5 described in Burrows et al, Biochem. (1994) 53:12741 and Zhang et al, Mol Cell Biol 
(1995) 75:4851 and bind receptors as described in Browning et al.,J. Immunol . (1994) 
747:1230, Androlewicz et aL, J. Biol Chem.{\992) 267:2542, and Crowe et al, Science 
(1994) 264:707. 

In vivo, TNFs proteolytically cleave a target protein as described in Kriegel et al, 
20 Cell ( 1 988) 53 AS and Mohler et al , Nature ( 1 994) 3 70:2 1 8 and demonstrate cell 

proliferation and differentiation activity. T-cell or thymocyte proliferation is assayed as 
described in Armitage et al., Eur. J. Immunol. (1992) 22:447; Current Protocols in 
Immunology, ed. J.E. Coligan et al., 3.1-3.19; Takai et al, J. Immunol. (1986) 737:3494- 
3500, Bertagnoli et al, J. Immunol. (1990) 145:1706, Bertagnoli et al,J. Immunol (1991) 
25 133:327, Bertagnoli et al,J. Immunol (1992) 149:377%, and Bowman et al., J. Immunol. 
(1994) 752:1756. B cell proliferation and Ig secretion are assayed as described in 
Maliszewski, J. Immunol. (1990) 144:302%, and Assays for B Cell Function: In Vitro 
Antibody Production , Mond and Brunswick, Current Protocols in Immunol., Coligan Ed vol 
1 pp 3.8.1-3.8.16, John Wiley and Sons, Toronto 1994, Kehrl et al, Science (1987) 255:1 144 
30 and Boussiotis et al, PNAS USA (1994) 91:7007. Other in vivo activities include 

upregulation of cell surface antigens, upregulation of costimulatory molecules, and cellular 
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aggregation/adhesion as described in Barrett et a!., J. Immunol. (1991) 1 46:1722; Bjorck et 
aL, Eur. J. Immunol (1993) 25:1771; Clark et al,Annu Rev. Immunol. (1991) 9:97; 
Ranheim et al,J. Exp. Med. (1994) 777:925; Yellin, J. Immunol. (1994) 153:666; and Gruss 
et al, Blood (1994) 6V:2305. 
5 Proliferation and differentiation of hematopoietic and lymphopoietic cells has also 

been shown in vivo for TNFs, using assays for embryonic differentiation and hematopoiesis 
as described in Johansson et aL, Cellular Biology (1995) 75:141, Keller et al, Mol Cell 
Biol. (1993) 75:473, McClanahan et aL, Blood (1993) 57:2903 and using assays to detect 
stem cell survival and differentiation as described in Culture of Hematopoietic Cells, 

10 Freshney et al eds, pp 1-21, 23-29, 139-162, 163-179, and 265-268, Wiley-Liss, Inc., New 
York, NY, 1994, and Hirajama et al. 9 PNAS USA (1992) SP:5907. 

In vivo activities of TNFs also include lymphocyte survival and apoptosis, assayed as 
described in Darzynkewicz et al, Cytometry (1992) 75:795; Gorczca et al, Leukemia (1993) 
7:659; Itoh et aL, Cell (1991) 66:233; Zacharduk, J. Immunol. (1990) 145:4037; Zamai et 

15 a!., Cytometry (1993) 74:891; and Gorczyca et al, lnt'U. Oncol. (1992) 7:639. Some 
members of the TNF family are cleaved from the cell surface; others remain membrane 
bound. The three-dimensional structure of TNF is discussed in Sprang and Eck, Tumor 
Necrosis Factors; supra. 



20 soluble version, as described in Kriegler et al, Cell (1988) 55:45, Perez et al, Cell (1990) 
65:251, and Shaw et al, Cell (1986) 46:659. The transmembrane domain is between amino 
acid 46 and 77 and the cytoplasmic domain is between position 1 and 45 on the human form 
of TNFa. The 3-dimensional motifs of TNF include a sandwich of two pleated p sheets. 
Each sheet is composed of anti-parallel p strands, p strands facing each other on opposite 

25 sites of the sandwich are connected by short polypeptide loops, as described in Van Ostade et 
al, Protein Engineering (1994) 7(1) :S 9 and Sprang et al, Tumor Necrosis Factors; supra. 
Residues of the TNF family proteins that are involved in the p sheet secondary structure 
have been identified as described in Van Ostade et al, Protein Eng. (1994) 7(1) :5, and 
Sprang et al, supra. 

30 TNF receptors are disclosed in U.S. Patent No. 5,395,760. A profile derived from the 

TNF receptor family is created by aligning sequences of the TNF receptor family, including 



TNF proteins include a transmembrane domain. The protein is cleaved into a shorter 
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Apol/Fas, TNFR I and II, death receptor 3 (DR3), CD40, ox40, CD27, and CD30. Thus, the 
profile is designed to identify from the polynucleotides of the invention sequences of 
proteins that constitute new members or homologues of this family of proteins. 

Tumor necrosis factor receptors exist in two forms in humans: p55 TNFR and p75 
5 TNFR, both of which provide intracellular signals upon binding with a ligand. The 

extracellular domains of these receptor proteins are cysteine rich. The receptors can remain 
membrane bound, although some forms of the receptors are cleaved forming soluble 
receptors. The regulation, diagnostic, prognostic, and therapeutic value of soluble TNF 
receptors is discussed in Aderka, Cytokine and Growth Factor Reviews, (1996) 7(3) :23l. 

10 PDGF Family. U.S. Patent No. 5,326,695 discloses platelet derived growth factor 

agonists; bioactive portions of PDGF-B are used as agonists. U.S. Patent No. 4,845,075 
discloses biologically active B-chain homodimers, and also includes variants and derivatives 
of the PDGF-B chain. U.S. Patent No. 5,128,321 discloses PDGF analogs and methods of 
use. Proteins having the same bioactivity as PDGF are disclosed, including A and B chain 

15 proteins. 

Kinase (Including MKK) Family. U.S. Patent No. 5,650,501 discloses 
serine/threonine kinase, associated with mitotic and meiotic cell division; the protein has a 
kinase domain in its N-terminal and 3 PEST regions in the C-terminus. U.S. Patent No. 
5,605,825 discloses human PAK65, a serine protein kinase. 
20 The foregoing discussion provides a few examples of the protein profiles that can be 

compared with the polynucleotides of the invention. One skilled in the art can use these and 
other protein profiles to identify the genes that correlate with the provided polynucleotides. 



25 particular interest. For example, levels of secreted polypeptides can be assayed in body 

fluids that are convenient, such as blood, urine, prostatic fluid and semen. Membrane-bound 
polypeptides are useful for constructing vaccine antigens or inducing an immune response. 
Such antigens would comprise all or part of the extracellular region of the membrane-bound 
polypeptides. Because both secreted and membrane-bound polypeptides comprise a 

30 fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms can 
be used to identify such polypeptides. 



C. Identification of Secreted & Membrane-Bound Polypeptides 

Both secreted and membrane-bound polypeptides of the present invention are of 
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A signal sequence is usually encoded by both secreted and membrane-bound 

polypeptide genes to direct a polypeptide to the surface of the cell. The signal sequence 

usually comprises a stretch of hydrophobic residues. Such signal sequences can fold into 

helical structures. Membrane-bound polypeptides typically comprise at least one 

transmembrane region that possesses a stretch of hydrophobic amino acids that can 

transverse the membrane. Some transmembrane regions also exhibit a helical structure. 

Hydrophobic fragments within a polypeptide can be identified by using computer 

algorithms. Such algorithms include Hopp & Woods A Proc. Natl. Acad. ScL USA (1981) 

75:3824-3828; Kyte & Doolittle, J. Mol Biol (1982) 157: 105-132; and RAOAR algorithm, 

Degli Esposti et a/., Eur. J. Biochem. (1990) 190: 207-219. 

Another method of identifying secreted and membrane-bound polypeptides is to 

translate the polynucleotides of the invention in all six frames and determine if at least 8 

contiguous hydrophobic amino acids are present. Those translated polypeptides with at least 

8; more typically, 10; even more typically, 12 contiguous hydrophobic amino acids are 

considered to be either a putative secreted or membrane bound polypeptide. Hydrophobic 

amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 

phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 



IV. Identification of the Function of an Expression Product of a Full-Length Gene 
Corresponding to a Polynucleotide 

Ribozymes, antisense constructs, and dominant negative mutants can be used to 
determine function of the expression product of a gene corresponding to a polynucleotide 
provided herein. These methods and compositions are particularly useful where the provided 
novel polynucleotide exhibits no significant or substantial homology to a sequence encoding 
a gene of known function. Antisense molecules and ribozymes can be constructed from 
synthetic polynucleotides. Typically, the phosphoramidite method of oligonucleotide 
synthesis is used. See Beaucage et aL, Tet. Lett. (1981) 22:1859 and U.S. Patent No. 
4,668,777. Automated devices for synthesis are available to create oligonucleotides using 
this chemistry. Examples of such devices include Biosearch 8600, Models 392 and 394 by 
Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, 
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phosphate analog oligonucleotides, and chemically derivatized oligonucleotides can also be 
produced, and can be covalently attached to other molecules. RNA oligonucleotides can be 
synthesized, for example, using RNA phosphoramidites. This method can be performed on 
an automated synthesizer, such as Applied Biosystems, Models 392 and 394, Foster City, 
5 California, USA. See Applied Biosystems User Bulletin 53 and Ogilvie et al, Pure & 
Applied Chem. (1987) 59:325. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. 
A sulfiirizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used 
to convert the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 
1 0 minutes at room temperature. TETD replaces the iodine reagent, while all other reagents 

used for standard phosphoramidite chemistry remain the same. Such a synthesis method can 
be automated using Models 392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nucleotides can be synthesized, more typically, 100 
nucleotides, more typically 50 nucleotides; even more typically 30 to 40 nucleotides. These 
15 synthetic fragments can be annealed and ligated together to construct larger fragments. See, 
for example, Sambrook et aL, supra, 
A. Ribozvmes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and 

20 the target message must contain a specific nucleotide sequence. They are engineered to 

cleave any RNA species site-specifically in the background of cellular RNA. The cleavage 
event renders the mRNA unstable and prevents protein expression. Importantly, ribozymes 
can be used to inhibit expression of a gene of unknown function for the purpose of 
determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 

25 One commonly used ribozyme motif is the hammerhead, for which the substrate 

sequence requirements are minimal. Design of the hammerhead ribozyme is disclosed in 
Usman et al, Current Opin. Struct Biol. (1996) 6:527. Usman also discusses the 
therapeutic uses of ribozymes. Ribozymes can also be prepared and used as described in 
Long etal y FASEB J. (1993) 7:25; Symons, Ann. Rev. Biochem. (1992) 67:641; Perrotta et 

30 al, Biochem. (1992) 37:16; Ojwang et al, Proc. Natl. Acad. Sci. (USA) (1992) £9:10802; 
and U.S. Patent No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in U.S. 
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Patent No. 5,144,019; methods of cleaving RNA using ribozymes is described in U.S. 
Patent No. 5,1 16,742; and methods for increasing the specificity of ribozymes are described 
in U.S. Patent No. 5,225,337 and Koizumi et al., Nucleic Acid Res. (1989) 7 7:7059. 
Preparation and use of ribozyme fragments in a hammerhead structure are also described by 
5 Koizumi et al y Nucleic Acids Res. (1989) 7 7:7059. Preparation and use of ribozyme 

fragments in a hairpin structure are described by Chowrira and Burke, Nucleic Acids Res. 
(1992) 20:2835. Ribozymes can also be made by rolling transcription as described in 
Daubendiek and Kool, Nat. Biotechnol. (1997) 15(3):213. 



10 branched structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 7 7:6959. The 
basic structure of the ribozymes can also be chemically altered in ways familiar to those 
skilled in the art, and chemically synthesized ribozymes can be administered as synthetic 
oligonucleotide derivatives modified by monomeric units. In a therapeutic context, liposome 
mediated delivery of ribozymes improves cellular uptake, as described in Birikh et al, Eur. 

1 5 J. Biochem. ( 1 997) 245:1. 

Using the polynucleotide sequences of the invention and methods known in the art, 
ribozymes are designed to specifically bind and cut the corresponding mRNA species. 
Ribozymes thus provide a means to inhibit the expression of any of the proteins encoded by 
the disclosed polynucleotides or their full-length genes. The full-length gene need not be 

20 known in order to design and use specific inhibitory ribozymes. In the case of a 

polynucleotide or full-length cDNA of unknown function, ribozymes corresponding to that 
nucleotide sequence can be tested in vitro for efficacy in cleaving the target transcript. 
Those ribozymes that effect cleavage in vitro are further tested in vivo. The ribozyme can 
also be used to generate an animal model for a disease, as described in Birikh et al.. supra. 

25 An effective ribozyme is used to determine the function of the gene of interest by blocking 
its transcription and detecting a change in the cell. Where the gene is found to be a mediator 
in a disease, an effective ribozyme is designed and delivered in a gene therapy for blocking 
transcription and expression of the gene. 



30 with knowledge of a portion of the coding sequence of the gene to be inhibited. Thus, for 
many genes, a partial polynucleotide sequence provides adequate sequence for constructing 



The hybridizing region of the ribozyme can be modified or can be prepared as a 



Therapeutic and functional genomic applications of ribozymes proceed beginning 
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an effective ribozyme. A target cleavage site is selected in the target sequence, and a 
ribozyme is constructed based on the 5' and 3' nucleotide sequences that flank the cleavage 
site. Retroviral vectors are engineered to express monomeric and multimeric hammerhead 
ribozymes targeting the mRNA of the target coding sequence. These monomeric and 
5 multimeric ribozymes are tested in vitro for an ability to cleave the target mRNA. A cell line 
is stably transduced with the retroviral vectors expressing the ribozymes, and the 
transduction is confirmed by Northern blot analysis and reverse-transcription polymerase 
chain reaction (RT-PCR). The cells are screened for inactivation of the target mRNA by 
such indicators as reduction of expression of disease markers or reduction of the gene 
1 0 product of the target mRNA. 

B. Antisense 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 

1 5 polynucleotide sequence can interfere with expression of the corresponding gene. Antisense 
polynucleotides are typically generated within the cell by expression from antisense 
constructs that contain the antisense strand as the transcribed strand. Antisense 
polynucleotides based on the disclosed polynucleotides will bind and/or interfere with the 
translation of mRNA comprising a sequence complementary to the antisense polynucleotide. 

20 The expression products of control cells and cells treated with the antisense construct are 

compared to detect the protein product of the gene corresponding to the polynucleotide upon 
which the antisense construct is based. The protein is isolated and identified using routine 
biochemical methods. 



25 corresponding to a disclosed polynucleotide is the biological activity of antisense 

therapeutics. Antisense therapy for a variety of cancers is in clinical phase and has been 
discussed extensively in the literature. Reed reviewed antisense therapy directed at the Bcl-2 
gene in tumors; gene transfer-mediated overexpression of Bcl-2 in tumor cell lines conferred 
resistance to many types of cancer drugs. (Reed, J.C., N.C.I. (1997) 59:988). The potential 

30 for clinical development of antisense inhibitors of ras is discussed by Cowsert, L.M., Anti- 
Cancer Drug Design (1997) 72:359. Additional important antisense targets include 



One rationale for using antisense methods to determine the function of the gene 
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leukemia (Geurtz, A.M., Anti-Cancer Drug Design (1997) 72:341); human C-ref kinase 
(Monia, B.P., Anti-Cancer Drug Design (1997) 7 2:327); and protein kinase C (McGraw et 
al, Anti-Cancer Drug Design (1997) 72:315. 

Given the extensive background literature and clinical experience in antisense 
5 therapy, one skilled in the art can use selected polynucleotides of the invention as additional 
potential therapeutics. The choice of polynucleotide can be narrowed by first testing them 
for binding to "hot spot" regions of the genome of cancerous cells. If a polynucleotide is 
identified as binding to a "hot spot", testing the polynucleotide as an antisense compound in 
the corresponding cancer cells clearly is warranted. 

10 Ogunbiyi et al, Gastroenterology (1997) 113(3):16\ describe prognostic use of 

allelic loss in colon cancer; Barks et al, Genes, Chromosomes, and Cancer (1997) 19(4):27$ 
describe increased chromosome copy number detected by FISH in malignant melanoma; 
Nishizake et aL, Genes, Chromosomes, and Cancer (1997) 19(4)\261 describe genetic 
alterations in primary breast cancer and their metastases and direct comparison using 

1 5 modified comparative genome hybridization; and Elo et aL, Cancer Research (1997) 

57(16):3356 disclose that loss of heterozygosity at 16z24.1-q24.2 is significantly associated 
with metastatic and aggressive behavior of prostate cancer. 



20 polynucleotide disclosed herein, dominant negative mutations are readily generated for 

corresponding proteins that are active as homomul timers. A mutant polypeptide will interact 
with wild-type polypeptides (made from the other allele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic domain, or a 
cellular localization domain. Preferably, the mutant polypeptide will be overproduced. 

25 Point mutations are made that have such an effect. In addition, fusion of different 

polypeptides of various lengths to the terminus of a protein can yield dominant negative 
mutants. General strategies are available for making dominant negative mutants (see, e.g., 
Herskowitz, Nature (1987) 329:219). Such techniques can be used to create loss of function 
mutations, which are useful for determining protein function. 



C. Dominant Negative Mutations 

As an alternative method for identifying function of the gene corresponding to a 
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V. Construction of Polypeptides of the Invention and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed 
polynucleotides. These polypeptides can also be encoded by nucleic acids that, by virtue of 



5 polynucleotides. Thus, the invention includes within its scope a polypeptide encoded by a 
polynucleotide having the sequence of any one of SEQ ID NOS: 1-844 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 
polypeptide encoded by the recited polynucleotide, the polypeptide encoded by the gene 
represented by the recited polynucleotide, as well as portions or fragments thereof. 

10 "Polypeptides" also includes variants of the naturally occurring proteins, where such 

variants are homologous or substantially similar to the naturally occurring protein, and can 
be of an origin of the same or different species as the naturally occurring protein (e.g., 
human, murine, or some other species that naturally expresses the recited polypeptide, 
usually a mammalian species). In general, variant polypeptides have a sequence that has at 

15 least about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide of the invention, as measured by BLAST 
using the parameters described above. The variant polypeptides can be naturally or non- 
naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

20 The invention also encompasses homologs of the disclosed polypeptides (or 

fragments thereof) where the homologs are isolated from other species, i.e. other animal or 
plant species, where such homologs, usually mammalian species, e.g. rodents, such as mice, 
rats; domestic animals, e.g., horse, cow, dog, cat; and humans. By homolog is meant a 
polypeptide having at least about 35%, usually at least about 40% and more usually at least 

25 about 60% amino acid sequence identity a particular differentially expressed protein as 

identified above, where sequence identity is determined using the BLAST algorithm, with 
the parameters described supra. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In 

30 certain embodiments, the subject protein is present in a composition that is enriched for the 
protein as compared to a control. As such, purified polypeptide is provided, where by 



the degeneracy of the genetic code, are not identical in sequence to the disclosed 
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purified is meant that the protein is present in a composition that is substantially free of non- 
differentially expressed polypeptides, where by substantially free is meant that less than 
90%, usually less than 60% and more usually less than 50% of the composition is made up 
of non-differentially expressed polypeptides. 



mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or 
deletions. The amino acid substitutions can be conservative amino acid substitutions or 
substitutions to eliminate non-essential amino acids, such as to alter a glycosylation site, a 
phosphorylation site or an acetylation site, or to minimize misfolding by substitution or 

10 deletion of one or more cysteine residues that are not necessary for function. Conservative 
amino acid substitutions are those that preserve the general charge, 
hydrophobicity/hydrophilicity, and/or steric bulk of the amino acid substituted. For 
example, substitutions between the following groups are conservative: Gly/Ala, Val/Ile/Leu, 
Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys, Thr, and Phe/Trp/Tyr. 

1 5 Variants can be designed so as to retain biological activity of a particular region of 

the protein (e.g. 9 a functional domain and/or, where the polypeptide is a member of a protein 
family, a region associated with a consensus sequence). In a non-limiting example, Osawa et 
aL, Biochem. Mol Int. (1994) 54:1003, discusses the actin binding region of a protein from 
several different species. The actin binding regions of the these species are considered 

20 homologous based on the fact that they have amino acids that fall within "homologous 

residue groups." Homologous residues are judged according to the following groups (using 
single letter amino acid designations): STAG; ILVMF; HRK; DEQN; and FYW. For 
example, and S, a T, an A or a G can be in a position and the function (in this case actin 
binding) is retained. 

25 Additional guidance on amino acid substitution is available from studies of protein 

evolution. Go et al, Int. J. Peptide Protein Res. (1980) 75:21 1, classified amino acid residue 
sites as interior or exterior depending on their accessibility. More frequent substitution on 
exterior sites was confirmed to be general in eight sets of homologous protein families 
regardless of their biological functions and the presence or absence of a prosthetic group. 

30 Virtually all types of amino acid residues had higher mutabilities on the exterior than in the 
interior. No correlation between mutability and polarity was observed of amino acid 



5 



Also within the scope of the invention are variants; variants of polypeptides include 
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residues in the interior and exterior, respectively. Amino acid residues were classified into 
one of three groups depending on their polarity: polar (Arg, Lys, His, Gin, Asn, Asp, and 
Glu); weak polar (Ala ? Pro, Gly, Thr, and Ser), and nonpolar (Cys, Val, Met, He, Leu, Phe, 
Tyr, and Trp). Amino acid replacements during protein evolution were very conservative: 
5 88% and 76% of them in the interior or exterior, respectively, were within the same group of 
the three. Inter-group replacements are such that weak polar residues are replaced more 
often by nonpolar residues in the interior and more often by polar residues on the exterior. 

Additional guidance for production of polypeptide variants is provided in Querol et 
aL, Prot. Eng. (1996) 9:265, which provides general rules for amino acid substitutions to 

10 enhance protein thermostability. New glycosylation sites can be introduced as discussed in 
Olsen and Thomsen, J. Gen. Microbiol (1991) 137:519. An additional disulfide bridge can 
be introduced, as discussed by Perry and Wetzel, Science (1984) 22(5:555; Pantoliano et aL, 
Biochemistry (1987) 26:2077; Matsumura et al, Nature (1989) .342:291; Nishikawa et al, 
Protein Eng. (1990) 3:443; Takagi et aL, J. Biol. Chem. (1990) 265:6874; Clarke et aL, 

1 5 Biochemistry (1 993) 32:4322; and Wakarchuk et aL, Protein Eng. ( 1 994) 7: 1 379. Metal 
binding sites can be introduced, according to Toma et aL, Biochemistry (1991) 30:97, and 
Haezerbrouck et aL, Protein Eng. (1993) (5:643. Substitutions with prolines in loops can be 
made according to Masul et aL, Appl Env. Microbiol. (1994) 60:3579; and Hardy et aL, 
FEBSLett.317:S9. 

20 Cysteine-depleted muteins are considered variants within the scope of the invention. 

These variants can be constructed according to methods disclosed in U.S. Patent No. 
4,959,314, which discloses substitution of cysteines with other amino acids, and methods 
for assaying biological activity and effect of the substitution. Such methods are suitable for 
proteins according to this invention that have cysteine residues suitable for such 

25 substitutions, for example to eliminate disulfide bond formation. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 10 aa to at least about 15 aa in length, 
usually at least about 50 aa in length, and can be as long as 300 aa in length or longer, but 

30 will usually not exceed about 1000 aa in length, where the fragment will have a stretch of 
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amino acids that is identical to a polypeptide encoded by a polynucleotide having a 
sequence of any SEQ ID NOS: 1-844, or a homolog thereof. 

The protein variants described herein are encoded by polynucleotides that are within 
the scope of the invention. The genetic code can be used to select the appropriate codons to 
5 construct the corresponding variants. 

VI. Computer-Related Embodiments 

In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of polynucleotide 

10 molecules), or in electronic form (e.g., as a collection of polynucleotide sequences stored in a 
computer-readable form, as in a computer system and/or as part of a computer program). 
The sequence information of the polynucleotides can be used in a variety of ways, e.g., as a 
resource for gene discovery, as a representation of sequences expressed in a selected cell 
type (e.g., cell type markers), and/or as markers of a given disease or disease state. In 

15 general, a disease marker is a representation of a gene product that is present in all affected 
by disease either at an increased or decreased level relative to a normal cell (e.g., a cell of the 
same or similar type that is not substantially affected by disease). For example, a 
polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either 

20 overexpressed or underexpressed in a breast ductal cell affected by cancer relative to a 
normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any suitable 
form, e.g., electronic or biochemical forms. For example, a library of sequence information 
embodied in electronic form includes an accessible computer data file (or, in biochemical 

25 form, a collection of nucleic acid molecules) that contains the representative nucleotide 

sequences of genes that are differentially expressed (e.g., overexpressed or underexpressed) 
as between, for example, i) a cancerous cell and a normal cell; ii) a cancerous cell and a 
dysplastic cell; iii) a cancerous cell and a cell affected by a disease or condition other than 
cancer; iv) a metastatic cancerous cell and a normal cell and/or non-metastatic cancerous 

30 cell; v) a malignant cancerous cell and a non-malignant cancerous cell (or a normal cell) 

and/or vi) a dysplastic cell relative to a normal cell. Other combinations and comparisons of 
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cells affected by various diseases or stages of disease will be readily apparent to the 
ordinarily skilled artisan. Biochemical embodiments of the library include a collection of 
nucleic acids that have the sequences of the genes in the library, where the nucleic acids can 
correspond to the entire gene in the library or to a fragment thereof, as described in greater 
5 detail below. 

The polynucleotide libraries of the subject invention include sequence information of 
a plurality of polynucleotide sequences, where at least one of the polynucleotides has a 
sequence of any of SEQ ID NOS: 1 -844. By plurality is meant at least 2, usually at least 3 
and can include up to all of SEQ ID NOS: 1-844. The length and number of polynucleotides 

10 in the library will vary with the nature of the library, e.g., if the library is an oligonucleotide 
array, a cDNA array, a computer database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can 
be present in a variety of media. "Media" refers to a manufacture, other than an isolated 
nucleic acid molecule, that contains the sequence information of the present invention. Such 

1 5 a manufacture provides the genome sequence or a subset thereof in a form that can be 

examined by means not directly applicable to the sequence as it exists in a nucleic acid. For 
example, the nucleotide sequence of the present invention, e.g. the nucleic acid sequences of 
any of the polynucleotides of SEQ ID NOS: 1-844, can be recorded on computer readable 
media, e.g. any medium that can be read and accessed directly by a computer. Such media 

20 include, but are not limited to: magnetic storage media, such as a floppy disc, a hard disc 
storage medium, and a magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. One of skill in the art can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 

25 comprising a recording of the present sequence information. ,, Recorded ,, refers to a process 
for storing information on computer readable medium, using any such methods as known in 
the art. Any convenient data storage structure can be chosen, based on the means used to 
access the stored information. A variety of data processor programs and formats can be used 
for storage, e.g. word processing text file, database format, etc. In addition to the sequence 

30 information, electronic versions of the libraries of the invention can be provided in 

conjunction or connection with other computer-readable information and/or other types of 
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computer-readable files (e.g., searchable files, executable files, etc, including, but not limited 
to, for example, search program software, etc.). 

By providing the nucleotide sequence in computer readable form, the information can 
be accessed for a variety of purposes. Computer software to access sequence information is 
5 publicly available. For example, the BLAST (Altschul et a!., supra.) and BLAZE (Brutlag 
et al. Comp. Chem. (1993) 17:203) search algorithms on a Sybase system can be used 
identify open reading frames (ORFs) within the genome that contain homology to ORFs 
from other organisms. 



10 means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. The data 

1 5 storage means can comprise any manufacture comprising a recording of the present sequence 
information as described above, or a memory access means that can access such a 
manufacture. 

"Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif with the stored sequence 

20 information. Search means are used to identify fragments or regions of the genome that 
match a particular target sequence or target motif. A variety of known algorithms are 
publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any DNA or amino acid sequence of six or 
more nucleotides or two or more amino acids, preferably from about 10 to 100 amino acids 

25 or from about 30 to 300 nucleotide residues. 

A "target structural motif," or "target motif," refers to any rationally selected 
sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration that is formed upon the folding of the target motif, or on 
consensus sequences of regulatory or active sites. There are a variety of target motifs known 

30 in the art. Protein target motifs include, but arc not limited to, enzyme active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, hairpin structures, 



As used herein, "a computer-based system" refers to the hardware means, software 
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promoter sequences and other expression elements such as binding sites for transcription 

factors. 

A variety of structural formats for the input and output means can be used to input 
and output the information in the computer-based systems of the present invention. One 
5 format for an output means ranks fragments of the genome possessing varying degrees of 
homology to a target sequence or target motif Such presentation provides a skilled artisan 
with a ranking of sequences and identifies the degree of sequence similarity contained in the 
identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
1 0 motif with the data storage means to identify sequence fragments of the genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer based systems of the present 
invention. 

As discussed above, the "library" of the invention also encompasses biochemical 

15 libraries of the polynucleotides of SEQ ID NOS: 1-844, e.g., collections of nucleic acids 
representing the provided polynucleotides. The biochemical libraries can take a variety of 
forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably associated with a 
surface of a solid support (i.e., an array) and the like. Of particular interest are nucleic acid 
arrays in which one or more of SEQ ID NOS: 1-844 is represented on the array. By array is 

20 meant a an article of manufacture that has at least a substrate with at least two distinct 

nucleic acid targets on one of its surfaces, where the number of distinct nucleic acids can be 
considerably higher, typically being at least 10 nt, usually at least 20 nt and often at least 25 
nt. A variety of different array formats have been developed and are known to those of skill 
in the art, including those described in 5,242,974; 5,384,261 ; 5,405,783; 5,412,087; 

25 5,424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5,554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,711; 5,639,603; 5,658,734; WO 
93/17126; WO 95/1 1995; WO 95/35505; EP 742287; and EP 799897. The arrays of the 
subject invention find use in a variety of applications, including gene expression analysis, 
drug screening, mutation analysis and the like, as disclosed in the above-listed exemplary 

3 0 patent documents . 
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In addition to the above nucleic acid libraries, analogous libraries of polypeptides are 
also provided, where the where the polypeptides of the library will represent at least a 
portion of the polypeptides encoded by SEQ ID NOS: 1-844. 

VII. Utilities 

A. Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 
Polynucleotide probes, generally comprising at least 12 contiguous nucleotides of a 
polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such as 
chromosome mapping of the polynucleotide and detection of transcription levels. Additional 
disclosure about preferred regions of the disclosed polynucleotide sequences is found in the 
Examples. A probe that hybridizes specifically to a polynucleotide disclosed herein should 
provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

Probes in Detection of Expression Levels. Nucleotide probes are used to detect 
expression of a gene corresponding to the provided polynucleotide. The references describe 
an example of a sandwich nucleotide hybridization assay. For example, in Northern blots, 
mRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 
condition. Probes are also used to detect products of amplification by polymerase chain 
reaction. The products of the reaction are hybridized to the probe and hybrids are detected. 
Probes are used for in situ hybridization to cells to detect expression. Probes can also be 
used in vivo for diagnostic detection of hybridizing sequences. Probes are typically labeled 
with a radioactive isotope. Other types of detectable labels can be used such as 
chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting 
small amounts of target nucleic acids (see, e.g., Mullis et ah, Meth. EnzymoL (1987) 
755:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). Two primer 
polynucleotides nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can be composed of sequence within or 3 f and 5' to the 
polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to these 
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polynucleotides, they need not hybridize to them or the complements. A thermostable 
polymerase creates copies of target nucleic acids from the primers using the original target 
nucleic acids as a template. After a large amount of target nucleic acids is generated by the 
polymerase, it is detected by methods such as Southern blots. When using the Southern blot 
5 method, the labeled probe will hybridize to a polynucleotide of the Sequence Listing or 
complement. 

Furthermore, mRNA or cDNA can be detected by traditional blotting techniques 
described in Sambrook et al, "Molecular Cloning: A Laboratory Manual" (New York, Cold 
Spring Harbor Laboratory, 1989). mRNA or cDNA generated from mRNA using a 
10 polymerase enzyme can be purified and separated using gel electrophoresis. The nucleic 
acids on the gel are then blotted onto a solid support, such as nitrocellulose. The solid 
support is exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the probe is labeled 
with radioactivity. 

1 5 Ma pping. Polynucleotides of the present invention are used to identify a 

chromosome on which the corresponding gene resides. Such mapping can be useful in 
identifying the function of the polynucleotide-related gene by its proximity to other genes 
with known function. Function can also be assigned to the polynucleotide-related gene when 
particular syndromes or diseases map to the same chromosome. For example, use of 

20 polynucleotide probes in identification and quantification of nucleic acid sequence 
aberrations is described in U.S. Patent No. 5,783,387. 

For example, fluorescence in situ hybridization (FISH) on normal metaphase spreads 
facilitates comparative genomic hybridization to allow total genome assessment of changes 
in relative copy number of DNA sequences. See Schwartz and Samad, Curr. Opin. 

25 BiotechnoL (1994) 5:70; Kallioniemi et a/., Sem. Cancer Biol. (1993) 4:4\; Valdes et aL 9 
Methods in Molecular Biology (1997) 68:1, Boultwood, ed., Human Press, Totowa, NJ. 
Preparations of human metaphase chromosomes are prepared using standard cytogenetic 
techniques from human primary tissues or cell lines. Nucleotide probes comprising at least 
12 contiguous nucleotides selected from the nucleotide sequence shown in the Sequence 

30 Listing are used to identify the corresponding chromosome. The nucleotide probes are 

labeled, for example, with a radioactive, fluorescent, biotinylated, or chemiluminescent label, 
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and detected by well known methods appropriate for the particular label selected. Protocols 
for hybridizing nucleotide probes to preparations of metaphase chromosomes are also well 
known in the art. A nucleotide probe will hybridize specifically to nucleotide sequences in 
the chromosome preparations that are complementary to the nucleotide sequence of the 



Polynucleotides are mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach et aL, Advances in Genetics, 
(1995) 53:63-99; Walter et aU Nature Genetics (1994) 7:22; Walter and Goodfellow, Trends 
in Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from Research 

10 Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 
available via the world wide web at http:/F/shgc-www.stanford.edu; and http://\\^v- 
genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl . The statistical program RHMAP can be used 
to construct a map based on the data from radiation hybridization with a measure of the 
relative likelihood of one order versus another. RHMAP is available via the world wide web 

15 at http://www.sph.umich.edu/group/statgen/software. 

In addition, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Polynucleotides based on 
the polynucleotides of the invention can be used to probe these regions. For example, if 
through profile searching a provided polynucleotide is identified as corresponding to a gene 

20 encoding a kinase, its ability to bind to a cancer-related chromosomal region will suggest its 
role as a kinase in one or more stages of tumor cell development/growth. Although some 
experimentation would be required to elucidate the role, the polynucleotide constitutes a new 
material for isolating a specific protein that has potential for developing a cancer diagnostic 
or therapeutic. 

25 Tissue Typing or Profiling. Expression of specific mRNA corresponding to the 

provided polynucleotides can vary in different cell types and can be tissue-specific. This 
variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types. For example, PCR, branched DNA probe assays, or 
blotting techniques utilizing nucleic acid probes substantially identical or complementary to 

30 polynucleotides listed in the Sequence Listing can determine the presence or absence of the 
corresponding cDNA or mRNA. 
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For example, a metastatic lesion is identified by its developmental organ or tissue 

source by identifying the expression of a particular marker of that organ or tissue. If a 

polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to 

express that polynucleotide, then the developmental source of the lesion has been identified. 

5 Expression of a particular polynucleotide is assayed by detection of either the corresponding 

mRNA or the protein product. Immunological methods, such as antibody staining, are used 

to detect a particular protein product. Hybridization methods can be used to detect particular 

mRNA species, including but not limited to in situ hybridization and Northern blotting. 

Use of Polymorphisms. A polynucleotide of the invention will be useful in forensics, 

10 genetic analysis, mapping, and diagnostic applications if the corresponding region of a gene 
is polymorphic in the human population. Particular polymorphic forms of the provided 
polynucleotides can be used to either identify a sample as deriving from a suspect or rule out 
the possibility that the sample derives from the suspect. Any means for detecting a 
polymorphism in a gene are used, including but not limited to electrophoresis of protein 

1 5 polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 
hybridization to allele-specific probes. 
B. Antibody Production 

Expression products of a polynucleotide of the invention, the corresponding mRNA 
or cDNA, or the corresponding complete gene are prepared and used for raising antibodies 

20 for experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a 

corresponding gene has not been assigned, this provides an additional method of identifying 
the corresponding gene. The polynucleotide or related cDNA is expressed as described 
above, and antibodies are prepared. These antibodies are specific to an epitope on the 
polypeptide encoded by the polynucleotide, and can precipitate or bind to the corresponding 

25 native protein in a cell or tissue preparation or in a cell-free extract of an in vitro expression 
system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides encoded 
by the polynucleotides of the present invention with adjuvants. Alternatively, polypeptides 
are made as fusion proteins to larger immunogenic proteins. Polypeptides are also 
30 covalently linked to other larger immunogenic proteins, such as keyhole limpet hemocyanin. 
Immunogens are typically administered intradermally, subcutaneously, or intramuscularly. 
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Immunogens are administered to experimental animals such as rabbits, sheep, and mice, to 
generate antibodies. Optionally, the animal spleen cells are isolated and fused with myeloma 
cells to form hybridomas which secrete monoclonal antibodies. Such methods are well 
known in the art. According to another method known in the art, the selected polynucleotide 
5 is administered directly, such as by intramuscular injection, and expressed in vivo. The 
expressed protein generates a variety of protein-specific immune responses, including 
production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides 
encoded by a selected polynucleotide are made using standard methods known in the art. 

10 The antibodies specifically bind to epitopes present in the polypeptides encoded by 
polynucleotides disclosed in the Sequence Listing. Typically, at least 6, 8, 10, or 12 
contiguous amino acids are required to form an epitope. However, epitopes which involve 
non-contiguous amino acids may require more, for example at least 15, 25, or 50 amino 
acids. A short sequence of a polynucleotide may then be unsuitable for use as an epitope to 

1 5 raise antibodies for identifying the corresponding novel protein, because of the potential for 
cross-reactivity with a known protein. However, the antibodies can be useful for other 
purposes, particularly if they identify common structural features of a known protein and a 
novel polypeptide encoded by a polynucleotide of the invention. 



20 polypeptides should provide a detection signal at least 5-, 1 0-, or 20-fold higher than a 
detection signal provided with other proteins when used in Western blots or other 
immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
invention do not bind to other proteins in immunochemical assays at detectable levels and 
can immunoprecipitate the specific polypeptide from solution. 

25 To test for the presence of serum antibodies to the polypeptide of the invention in a 

human population, human antibodies are purified by methods well known in the art. 
Preferably, the antibodies are affinity purified by passing antiserum over a column to which 
the corresponding selected polypeptide or fusion protein is bound. The bound antibodies can 
then be eluted from the column, for example using a buffer with a high salt concentration. 



Antibodies that specifically bind to human polypeptides encoded by the provided 
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In addition to the antibodies discussed above, genetically engineered antibody 

derivatives are made, such as single chain antibodies, according to methods well known in 
the art. 

C. Use of Polynucleotides to Construct Arrays for Diagnostics 

5 Polynucleotide arrays provide a high throughput technique that can assay a large 

number of polynucleotide sequences in a sample. This technology can be used as a 
diagnostic and as a tool to test for differential expression to determine function of an 
encoded protein. Arrays can be created by spotting polynucleotide probes onto a substrate 
(e.g., glass, nitrocelllose, etc.) in a two-dimensional matrix or array having bound probes. 

10 The probes can be bound to the substrate by either covalent bonds or by non-specific 

interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably 
labeled (e.g., using radioactive or fluorescent labels) and then hybridized to the probes. 
Double stranded polynucleotides, comprising the labeled sample polynucleotides bound to 
probe polynucleotides, can be detected once the unbound portion of the sample is washed 

15 away. Techniques for constructing arrays and methods of using these arrays are described in 
EP No. 0 799 897; PCTNo. WO 97/29212; PCTNo. WO 97/27317; EPNo. 0 785 280; PCT 
No. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP No. 0 728 520; 
U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCTNo. WO 
95/22058; and U.S. Pat. No. 5,631,734. 

20 As discussed in some detail above, arrays can be used to examine differential 

expression of genes and can be used to determine gene function. For example, arrays of the 
instant polynucleotide sequences can be used to determine if any of the provided 
polynucleotides are differentially expressed between a test cell and control cell (e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer 

25 cell, which is not observed in a corresponding normal cell, can indicate a cancer specific 

protein. Exemplary uses of arrays are further described in, for example, Pappalarado et al., 
Sem. Radiation Oncol. (1998) 5:217; and Ramsay Nature Biotechnol. (1998) 76:40. 

D. Differential Expression 

The polynucleotides of the invention can also be used to detect differences in 
30 expression levels between two cells, e.g. , as a method to identify abnormal or diseased tissue 
in a human. For polynucleotides corresponding to profiles of protein families as described 
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above, the choice of tissue can be selected according to the putative biological function. In 
general, the expression of a gene corresponding to a specific polynucleotide is compared 
between a first tissue that is suspected of being diseased and a second, normal tissue of the 
human. The tissue suspected of being abnormal or diseased can be derived from a different 
5 tissue type of the human, but preferably it is derived from the same tissue type; for example 
an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of interest 
(e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 

10 pancreas, and the mucosal lining of the colon). A difference between the polynucleotide- 
related gene, mRNA, or protein in the two tissues which are compared, for example in 
molecular weight, amino acid or nucleotide sequence, or relative abundance, indicates a 
change in the gene, or a gene which regulates it, in the tissue of the human that was 
suspected of being diseased. Examples of detection of differential expression and its use in 

15 diagnosis of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

The polynucleotide-related genes in the two tissues are compared by any means 
known in the art. For example, the two genes can be sequenced, and the sequence of the 
gene in the tissue suspected of being diseased compared with the gene sequence in the 
normal tissue. The genes corresponding to a provided polynucleotide, or portions thereof, in 

20 the two tissues are amplified, for example using nucleotide primers based on the nucleotide 
sequence shown in the Sequence Listing, using the polymerase chain reaction. The 
amplified genes or portions of genes are hybridized to detectably labeled nucleotide probes 
selected from a nucleotide sequence shown in the Sequence Listing. A difference in the 
nucleotide sequence of the isolated gene in the tissue suspected of being diseased compared 

25 with the normal nucleotide sequence suggests a role of the gene product encoded by the 
subject polynucleotide in the disease, and provides guidance for preparing a therapeutic 
agent. 

Alternatively, mRNA corresponding to a provided polynucleotide in the two tissues 
is compared. PolyA + RNA is isolated from the two tissues as is known in the art. For 
30 example, one of skill in the art can readily determine differences in the size or amount of 
mRNA transcripts between the two tissues using Northern blots and detectably labeled 
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nucleotide probes selected from the nucleotide sequence shown in the Sequence Listing. 
Increased or decreased expression of a given mRNA in a tissue sample suspected of being 
diseased, compared with the expression of the same mRNA in a normal tissue, suggests that 
the expressed protein has a role in the disease, and also provides a lead for preparing a 
5 therapeutic agent. 

The comparison can also be accomplished by analyzing polypeptides between the 
matched samples. The sizes of the proteins in the two tissues are compared, for example, 
using antibodies of the present invention to detect polypeptides in Western blots of protein 
extracts from the two tissues. Other changes, such as expression levels and subcellular 

10 localization, can also be detected immunologically, using antibodies to the corresponding 

protein. A higher or lower level of expression of a given polypeptide in a tissue suspected of 
being diseased, compared with the same protein expression level in a normal tissue, is 
indicative that the expressed protein has a role in the disease, and provides guidance for 
preparing a therapeutic agent. 

1 5 Similarly, comparison of polynucleotide sequences or of gene expression products, 

e.g., mRNA and protein, between a human tissue that is suspected of being diseased and a 
normal tissue of a human, are used to follow disease progression or remission in the human. 
Such comparisons are made as described above. For example, increased or decreased 
expression of a gene corresponding to an inventive polynucleotide in the tissue suspected of 

20 being neoplastic can indicate the presence of neoplastic cells in the tissue. The degree of 
increased expression of a given gene in the neoplastic tissue relative to expression of the 
same gene in normal tissue, or differences in the amount of increased expression of a given 
gene in the neoplastic tissue over time, is used to assess the progression of the neoplasia in 
that tissue or to monitor the response of the neoplastic tissue to a therapeutic protocol over 



The expression pattern of any two cell types can be compared, such as low and high 
metastatic tumor cell lines, malignant or non-malignant cells, or cells from tissue which have 
and have not been exposed to a therapeutic agent. A genetic predisposition to disease in a 
human is detected by comparing expression levels of an mRNA or protein corresponding to 
30 a polynucleotide of the invention in a fetal tissue with levels associated in normal fetal 

tissue. Fetal tissues that are used for this purpose include, but are not limited to, amniotic 



25 



time. 
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fluid, chorionic villi, blood, and the blastomere of an in vitro-fertilized embryo. The 
comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA or 
protein is obtained from a normal tissue of a human in which the polynucleotide-related gene 
is expressed. Differences such as alterations in the nucleotide sequence or size of the same 
5 product of the fetal polynucleotide-related gene or mRNA, or alterations in the molecular 
weight, amino acid sequence, or relative abundance of fetal protein, can indicate a germline 
mutation in the polynucleotide-related gene of the fetus, which indicates a genetic 
predisposition to disease. Particular diagnostic and prognostic uses of the disclosed 
polynucleotides are described in more detail below. 
10 E. Diagnostic, Prognostic, and Other Uses Based On Differential Expression 

In general, diagnostic methods of the invention for involve detection of a level or 
amount of a gene product, particularly a differentially expressed gene product, in a test 
sample obtained from a patient suspected of having or being susceptible to a disease (e.g., 
breast cancer, lung cancer, colon cancer and/or metastatic forms thereof), and comparing the 

15 detected levels to those levels found in normal cells (e.g., cells substantially unaffected by 
cancer) and/or other control cells (e.g., to differentiate a cancerous cell from a cell affected 
by dysplasia). Furthermore, the severity of the disease can be assessed by comparing the 
detected levels of a differentially expressed gene product with those levels detected in 
samples representing the levels of differentially gene product associated with varying 

20 degrees of severity of disease. 

The term "differentially expressed gene" is intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g., a 
polypeptide), and/or introns of such genes and adjacent 5* and 3' non-coding nucleotide 
sequences involved in the regulation of expression, up to about 20 kb beyond the coding 

25 region, but possibly further in either direction. The gene can be introduced into an 

appropriate vector for extrachromosomal maintenance or for integration into a host genome. 
In general, a difference in expression level associated with a decrease in expression level of 
at least about 25%, usually at least about 50% to 75%, more usually at least about 90% or 
more is indicative of a differentially expressed gene of interest, i.e., a gene that is 

30 underexpressed or down-regulated in the test sample relative to a control sample. 

Furthermore, a difference in expression level associated with an increase in expression of at 
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least about 25%, usually at least about 50% to 75%, more usually at least about 90% and can 

be at least about 1 14- fold, usually at least about 2-fold to about 10-fold, and can be about 

100-fold to about 1,000-fold increase relative to a control sample is indicative of a 

differentially expressed gene of interest, i.e., an overexpressed or up-regulated gene. 

5 "Differentially expressed polynucleotide" as used herein means a nucleic acid 

molecule (RNA or DNA) having a sequence that represents a differentially expressed gene, 

e.g., the differentially expressed polynucleotide comprises a sequence (e.g., an open reading 

frame encoding a gene product) that uniquely identifies a differentially expressed gene so 

that detection of the differentially expressed polynucleotide in a sample is correlated with the 

10 presence of a differentially expressed gene in a sample. "Differentially expressed 

polynucleotides" is also meant to encompass fragments of the disclosed polynucleotides, 
e.g., fragments retaining biological activity, as well as nucleic acids homologous, 
substantially similar, or substantially identical (e.g., having about 90% sequence identity) to 
the disclosed polynucleotides. 

15 Methods of the subject invention useful in diagnosis or prognosis typically involve 

comparison of the abundance of a selected differentially expressed gene product in a sample 
of interest with that of a control to determine any relative differences in the expression of the 
gene product, where the difference can be measured qualitatively and/or quantitatively. 
Quantitation can be accomplished, for example, by comparing the level of expression 

20 product detected in the sample with the amounts of product present in a standard curve. A 
comparison can be made visually; by using a technique such as densitometry, with or 
without computerized assistance; by preparing a representative library of cDNA clones of 
mRNA isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number 

25 of clones corresponding to that same gene product relative to the number of clones of the 
same gene product in a control sample; or by using an array to detect relative levels of 
hybridization to a selected sequence or set of sequences, and comparing the hybridization 
pattern to that of a control. The differences in expression are then correlated with the 
presence or absence of an abnormal expression pattern. A variety of different methods for 

30 determining the nucleic acid abundance in a sample are known to those of skill in the art, 
where particular methods of interest include those described in: Pietu et al. Genome Res. 
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(1996) 6:492; Zhao et al t Gene (1995) 1 56:201 \ Soares , Curr. Opin. Biotechnol (1977) 8: 

542; Raval, J. Pharmacol Toxicol Methods (1994) J2:125; Chalifour et al, Anal. Biochem 
(1994) 27(5:299; Stolz e/ a/., Mo/. Biotechnol. (1996) <5:225; Hong e/ al, Biosci. Reports 
(1982) 2:907; and McGraw ? ^«a/. Biochem. (1984) 745:298. Also of interest are the 
5 methods disclosed in WO 97/273 1 7, the disclosure of which is herein incorporated by 
reference. 

In general, diagnostic assays of the invention involve detection of a gene product of a 
the polynucleotide sequence {e.g., mRNA or polypeptide) that corresponds to a sequence of 
SEQ ID NOS: 1-844. The patient from whom the sample is obtained can be apparently 

10 healthy, susceptible to disease (e.g., as determined by family history or exposure to certain 
environmental factors), or can already be identified as having a condition in which altered 
expression of a gene product of the invention is implicated. 

In the assays of the invention, the diagnosis can be determined based on detected 
gene product expression levels of a gene product encoded by at least one, preferably at least 

15 two or more, at least 3 or more, or at least 4 or more of the polynucleotides having a 

sequence set forth in SEQ ID NOS: 1-844, and can involve detection of expression of genes 
corresponding to all of SEQ ID NOS: 1-844 and/or additional sequences that can serve as 
additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably 

20 involves detection of a gene product encoded by a gene corresponding to a polynucleotide 
that is differentially expressed in cancer. For example, a higher level of expression of a 
polynucleotide corresponding to SEQ ID NO:52 relative to a level associated with a normal 
sample can indicate the presence of cancer in the patient from whom the sample is derived. 
In another example, detection of a lower level of a polynucleotide corresponding to SEQ ID 

25 NO:39 relative to a normal level is indicative of the presence of cancer in the patient. 
Further examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their 
relative expression levels provided herein, assays using such polynucleotides and detection 
of their expression levels in diagnosis and prognosis will be readily apparent to the ordinarily 

30 skilled artisan. 
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Any of a variety of detectable labels can be used in connection with the various 
embodiments of the diagnostic methods of the invention. Suitable detectable labels include 
fluorochromes,(e .g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7 , -dimethoxy-4\5 , - 
5 dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

2%4\7\4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'- 
tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, {e.g. 32 P, 3 'S, 3 H, etc.), and 
the like. The detectable label can involve a two stage systems {e.g., biotin-avidin, hapten- 
anti-hapten antibody, etc.) 

10 Reagents specific for the polynucleotides and polypeptides of the invention, such as 

antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an 
expression product in a biological sample. The kit can also contain buffers or labeling 
components, as well as instructions for using the reagents to detect and quantify expression 
products in the biological sample. Exemplary embodiments of the diagnostic methods of the 

1 5 invention are described below in more detail. 

Polypeptide detection in diagnosis. In one embodiment, the test sample is assayed 
for the level of a differentially expressed polypeptide. Diagnosis can be accomplished using 
any of a number of methods to determine the absence or presence or altered amounts of the 
differentially expressed polypeptide in the test sample. For example, detection can utilize 

20 staining of cells or histological sections with labeled antibodies, performed in accordance 
with conventional methods. Cells can be permeabilized to stain cytoplasmic molecules. In 
general, antibodies that specifically bind a differentially expressed polypeptide of the 
invention are added to a sample, and incubated for a period of time sufficient to allow 
binding to the epitope, usually at least about 10 minutes. The antibody can be detectably 

25 labeled for direct detection {e.g., using radioisotopes, enzymes, fluorescers, 

chemiluminescers, and the like), or can be used in conjunction with a second stage antibody 
or reagent to detect binding {e.g., biotin with horseradish peroxidase -conjugated avidin, a 
secondary antibody conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, 
Texas red, etc.). The absence or presence of antibody binding can be determined by various 

30 methods, including flow cytometry of dissociated cells, microscopy, radiography, 

scintillation counting, etc. Any suitable alternative methods can of qualitative or quantitative 
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detection of levels or amounts of differentially expressed polypeptide can be used, for 
example ELISA, western blot, immunoprecipitation, radioimmunoassay, etc. 

In general, the detected level of differentially expressed polypeptide in the test 
sample is compared to a level of the differentially expressed gene product in a reference or 
5 control sample, e.g., in a normal cell (negative control) or in a cell having a known disease 
state (positive control). For example, a higher level of expression of a polypeptide encoded 
by SEQ ID NO:52 relative to a level associated with a normal sample can indicate the 
presence of cancer in the patient from whom the sample is derived. In another example, 
detection of a lower level of the polypeptide encoded by SEQ ID NO:39 relative to a normal 

1 0 level is indicative of the presence of cancer in the patient. 

mRNA detection. The diagnostic methods of the invention can also or alternatively 
involve detection of mRNA encoded by a gene corresponding to a differentially expressed 
polynucleotides of the invention. Any suitable qualitative or quantitative methods known in 
the art for detecting specific mRNAs can be used. mRNA can be detected by, for example, 

15 in situ hybridization in tissue sections, by reverse transcriptase-PCR, or in Northern blots 
containing poly A+ mRNA. One of skill in the art can readily use these methods to 
determine differences in the size or amount of mRNA transcripts between two samples. For 
example, the level of mRNA of the invention in a tissue sample suspected of being 
cancerous or dysplastic is compared with the expression of the mRNA in a reference sample, 

20 e.g., a positive or negative control sample (e.g., normal tissue, cancerous tissue, etc.). In a 
specific non-limiting example, a higher level of mRNA corresponding to SEQ ID NO:52 
relative to a level associated with a normal sample can indicate the presence of cancer in the 
patient from whom the sample is derived. In another example, detection of a lower level of 
mRNA corresponding to SEQ ID NO:39 relative to a normal level is indicative of the 

25 presence of cancer in the patient. 

Any suitable method for detecting and comparing mRNA expression levels in a 
sample can be used in connection with the diagnostic methods of the invention (see, e.g., 
U.S. 5,804,382). For example, mRNA expression levels in a sample can be determined by 
generation of a library of expressed sequence tags (ESTs) from the sample, where the EST 

30 library is representative of cequences present in the sample (Adams, et al., (1991) Science 

252:1651). Enumeration of the relative representation of ESTs within the library can be used 
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to approximate the relative representation of the gene transcript within the starting sample. 
The results of EST analysis of a test sample can then be compared to EST analysis of a 
reference sample to determine the relative expression levels of a selected polynucleotide, 
particularly a polynucleotide corresponding to one or more of the differentially expressed 
5 genes described herein. 

Alternatively, gene expression in a test sample can be performed using serial analysis 
of gene expression (SAGE) methodology (Velculescu et al., Science (1995) 270:484). In 
short, SAGE involves the isolation of short unique sequence tags from a specific location 
within each transcript {e.g., a sequence of any one of SEQ ID NOS: 1-6). The sequence tags 

1 0 are concatenated, cloned, and sequenced. The frequency of particular transcripts within the 
starting sample is reflected by the number of times the associated sequence tag is 
encountered with the sequence population. 

Gene expression in a test sample can also be analyzed using differential display (DD) 
methodology. In DD, fragments defined by specific sequence delimiters {e.g., restriction 

15 enzyme sites) are used as unique identifiers of genes, coupled with information about 

fragment length or fragment location within the expressed gene. The relative representation 
of an expressed gene with a sample can then be estimated based on the relative 
representation of the fragment associated with that gene within the pool of all possible 
fragments. Methods and compositions for carrying out DD are well known in the art, see, 

20 e.g., U.S. 5,776,683; and U.S. 5,807,680. 

Alternatively, gene expression in a sample using hybridization analysis, which is 
based on the specificity of nucleotide interactions. Oligonucleotides or cDNA can be used to 
selectively identify or capture DNA or RNA of specific sequence composition, and the 
amount of RNA or cDNA hybridized to a known capture sequence determined qualitatively 

25 or quantitatively, to provide information about the relative representation of a particular 
message within the pool of cellular messages in a sample. Hybridization analysis can be 
designed to allow for concurrent screening of the relative expression of hundreds to 
thousands of genes by using, for example, array-based technologies having high density 
formats, including filters, microscope slides, or microchips, or solution-based technologies 

30 that use spectroscopic analysis {e.g., mass spectrometry). One exemplary use of arrays in the 
diagnostic methods of the invention is described below in more detail. 
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Use of a single gene in diagnostic applications. The diagnostic methods of the 

invention can focus on the expression of a single differentially expressed gene. For example, 
the diagnostic method can involve detecting a differentially expressed gene, or a 
polymorphism of such a gene (e.g., a polymorphism in an coding region or control region), 
5 that is associated with disease. Disease-associated polymorphisms can include deletion or 
truncation of the gene, mutations that alter expression level and/or affect activity of the 
encoded protein, etc. 

Changes in the promoter or enhancer sequence that affect expression levels of an 
differentially gene can be compared to expression levels of the normal allele by various 

10 methods known in the art. Methods for determining promoter or enhancer strength include 
quantitation of the expressed natural protein; insertion of the variant control element into a 
vector with a reporter gene such as p-galactosidase, luciferase, chloramphenicol 
acetyltransferase, etc. that provides for convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 

15 specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA 
are available, genomic DNA is used directly. Alternatively, the region of interest is cloned 
into a suitable vector and grown in sufficient quantity for analysis. Cells that express a 
differentially expressed gene can be used as a source of mRNA, which can be assayed 
directly or reverse transcribed into cDNA for analysis. The nucleic acid can be amplified by 

20 conventional techniques, such as the polymerase chain reaction (PCR), to provide sufficient 
amounts for analysis, and a detectable label can be included in the amplification reaction 
(e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to facilitate 
detection. The use of the polymerase chain reaction is described in Saiki, et aL, Science 
(1985) 259:487, and a review of techniques can be found in Sambrook, et al., Molecular 

25 Cloning: A Laboratory Manual, (1 989) pp. 14.2. Alternatively, various methods are known 
in the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et al, Nucl. Acids Res. (1990) 75:2887; and Delahunty et al., Am. J. 
Hum. Genet. (1996) 55:1239. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 

30 number of methods known in the art. The nucleic acid can be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a selected sequence, e.g., to a wild-type 

56 



BNSDOCtD: <WO 9933982A2_I_> 



WO 99/33982 PCT/US98/27610 
sequence. Hybridization with the polymorphic or variant sequence can also be used to 

determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The hybridization 

pattern of a polymorphic or variant sequence and a control sequence to an array of 

oligonucleotide probes immobilized on a solid support, as described in US 5,445,934, or in 

5 WO 95/35505, can also be used as a means of identifying polymorphic or variant sequences 

associated with disease. Single strand conformational polymorphism (SSCP) analysis, 

denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices 

are used to detect conformational changes created by DNA sequence variation as alterations 

in electrophoretic mobility. Alternatively, where a polymorphism creates or destroys a 

10 recognition site for a restriction endonuclease, the sample is digested with that endonuclease, 

and the products size fractionated to determine whether the fragment was digested. 

Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 

agarose gels. 

Screening for mutations in an differentially expressed gene can be based on the 

1 5 functional or antigenic characteristics of the protein. Protein truncation assays are useful in 
detecting deletions that can affect the biological activity of the protein. Various 
immunoassays designed to detect polymorphisms in proteins can be used in screening. 
Where many diverse genetic mutations lead to a particular disease phenotype, functional 
protein assays have proven to be effective screening tools. The activity of the encoded 

20 protein can be determined by comparison with the wild-type protein. 

Pattern matching in diagnosis using arrays. In another embodiment, the diagnostic 
and/or prognostic methods of the invention involve detection of expression of a selected set 
of genes in a test sample to produce a test expression pattern (TEP). The TEP is compared to 
a reference expression pattern (REP), which is generated by detection of expression of the 

25 selected set of genes in a reference sample (e.g., a positive or negative control sample). The 
selected set of genes includes at least one of the genes of the invention, which genes 
correspond to the polynucleotide sequences of SEQ ID NOS: 1-844. Of particular interest is 
a selected set of genes that includes gene differentially expressed in the disease for which the 
test sample is to be screened. 

30 "Reference sequences" or "reference polynucleotides" as used herein in the context of 

differential gene expression analysis and diagnosis/prognosis refers to a selected set of 
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polynucleotides, which selected set includes at least one or more of the differentially 
expressed polynucleotides described herein. A plurality of reference sequences, preferably 
comprising positive and negative control sequences, can be included as reference sequences. 
Additional suitable reference sequences are found in Genbank, Unigene, and other 
5 nucleotide sequence databases (including, e.g., expressed sequence tag (EST), partial, and 
full-length sequences). 

"Reference array" means an array having reference sequences for use in hybridization 
with a sample, where the reference sequences include all, at least one of, or any subset of the 
differentially expressed polynucleotides described herein. Usually such an array will include 

10 at least 3 different reference sequences, and can include any one or all of the provided 
differentially expressed sequences. Arrays of interest can further comprise sequences, 
including polymorphisms, of other genetic sequences, particularly other sequences of interest 
for screening for a disease or disorder (e.g., cancer, dysplasia, or other related or unrelated 
diseases, disorders, or conditions). The oligonucleotide sequence on the array will usually 

15 be at least about 12 nt in length, and can be of about the length of the provided sequences, or 
can extend into the flanking regions to generate fragments of 100 nt to 200 nt in length or 
more. 

A "reference expression pattern" or "REP" as used herein refers to the relative levels 
of expression of a selected set of genes, particularly of differentially expressed genes, that is 
20 associated with a selected cell type, e.g., a normal cell, a cancerous cell, a cell exposed to an 
environmental stimulus, and the like. A "test expression pattern" or "TEP" refers to relative 
levels of expression of a selected set of genes, particularly of differentially expressed genes, 
in a test sample (e.g., a cell of unknown or suspected disease state, from which mRNA is 



susceptibility to a disease or disorder, determination as to whether a subject is presently 
affected by a disease or disorder, as well as to the prognosis of a subject affected by a disease 
or disorder (e.g., identification of pre-metastatic or metastatic cancerous states, stages of 
cancer, or responsiveness of cancer to therapy). The present invention particularly 
30 encompasses diagnosis of subjects in the context of breast cancer (e.g. , carcinoma in situ 
(e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive breast cancer, ER-negative 



isolated). 



25 



'Diagnosis" as used herein generally includes determination of a subject's 
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breast cancer, or other forms and/or stages of breast cancer), lung cancer (e.g., small cell 

carcinoma, non-small cell carcinoma, mesothelioma, and other forms and/or stages of lung 
cancer), and colon cancer (e.g., adenomatous polyp, colorectal carcinoma, and other forms 
and/or stages of colon cancer). 
5 "Sample" or "biological sample" as used throughout here are generally meant to refer 

to samples of biological fluids or tissues, particularly samples obtained from tissues, 
especially from cells of the type associated with the disease for which the diagnostic 
application is designed (e.g., ductal adenocarcinoma), and the like. "Samples" is also meant 
to encompass derivatives and fractions of such samples (e.g., cell lysates). Where the sample 

10 is solid tissue, the cells of the tissue can be dissociated or tissue sections can be analyzed. 

REPs can be generated in a variety of ways according to methods well known in the 
art. For example, REPs can be generated by hybridizing a control sample to an array having 
a selected set of polynucleotides (particularly a selected set of differentially expressed 
polynucleotides), acquiring the hybridization data from the array, and storing the data in a 

15 format that allows for ready comparison of the REP with a TEP. Alternatively, all expressed 
sequences in a control sample can be isolated and sequenced, e.g., by isolating mRNA from 
a control sample, converting the mRNA into cDNA, and sequencing the cDNA. The 
resulting sequence information roughly or precisely reflects the identity and relative number 
of expressed sequences in the sample. The sequence information can then be stored in a 

20 format (e.g., a computer-readable format) that allows for ready comparison of the REP with 
a TEP. The REP can be normalized prior to or after data storage, and/or can be processed to 
selectively remove sequences of expressed genes that are of less interest or that might 
complicate analysis (e.g., some or all of the sequences associated with housekeeping genes 
can be eliminated from REP data). 

25 TEPs can be generated in a manner similar to REPs, e.g. , by hybridizing a test sample 

to an array having a selected set of polynucleotides, particularly a selected set of 
differentially expressed polynucleotides, acquiring the hybridization data from the array, and 
storing the data in a format that allows for ready comparison of the TEP with a REP. The 
REP and TEP to be used in a comparison can be generated simultaneously, or the TEP can 

30 be compared to previously generated and stored REPs. 
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In one embodiment of the invention, comparison of a TEP with a REP involves 
hybridizing a test sample with a reference array, where the reference array has one or more 
reference sequences for use in hybridization with a sample. The reference sequences include 
all, at least one of, or any subset of the differentially expressed polynucleotides described 
5 herein. Hybridization data for the test sample is acquired, the data normalized, and the 
produced TEP compared with a REP generated using an array having the same or similar 
selected set of differentially expressed polynucleotides. Probes that correspond to sequences 
differentially expressed between the two samples will show decreased or increased 
hybridization efficiency for one of the samples relative to the other. 

10 Reference arrays can be produced according to any suitable methods known in the 

art. For example, methods of producing large arrays of oligonucleotides are described in 
U.S. 5,134,854, and U.S. 5,445,934 using light-directed synthesis techniques. Using a 
computer controlled system, a heterogeneous array of monomers is converted, through 
simultaneous coupling at a number of reaction sites, into a heterogeneous array of polymers. 

15 Alternatively, microarrays are generated by deposition of pre-synthesized oligonucleotides 
onto a solid substrate, for example as described in PCT published application no. 
WO 95/35505. 

Methods for collection of data from hybridization of samples with a reference arrays 
are also well known in the art. For example, the polynucleotides of the reference and test 

20 samples can be generated using a detectable fluorescent label, and hybridization of the 

polynucleotides in the samples detected by scanning the microarrays for the presence of the 
detectable label. Methods and devices for detecting fluorescently marked targets on devices 
are known in the art. Generally, such detection devices include a microscope and light 
source for directing light at a substrate. A photon counter detects fluorescence from the 

25 substrate, while an x-y translation stage varies the location of the substrate. A confocal 
detection device that can be used in the subject methods is described in U.S. Patent no. 
5,631,734. A scanning laser microscope is described in Shalon et al., Genome Res. (1996) 
6:639. A scan, using the appropriate excitation line, is performed for each fluorophore used. 
The digital images generated from the scan are then combined for subsequent analysis. For 

30 any particular array element, the ratio of the fluorescent signal from one sample (e.g., a test 
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sample) is compared to the fluorescent signal from another sample (e.g., a reference sample), 

and the relative signal intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are well known 

in the art. For example, where detection of hybridization involves a fluorescent label, data 

5 analysis can include the steps of determining fluorescent intensity as a function of substrate 

position from the data collected, removing outliers, i.e. data deviating from a predetermined 

statistical distribution, and calculating the relative binding affinity of the targets from the 

remaining data. The resulting data can be displayed as an image with the intensity in each 

region varying according to the binding affinity between targets and probes. 

10 In general, the test sample is classified as having a gene expression profile 

corresponding to that associated with a disease or non-disease state by comparing the TEP 
generated from the test sample to one or more REPs generated from reference samples (e.g., 
from samples associated with cancer or specific stages of cancer, dysplasia, samples affected 
by a disease other than cancer, normal samples, etc.). The criteria for a match or a 

15 substantial match between a TEP and a REP include expression of the same or substantially 
the same set of reference genes, as well as expression of these reference genes at 
substantially the same levels {e.g., no significant difference between the samples for a signal 
associated with a selected reference sequence after normalization of the samples, or at least 
no greater than about 25% to about 40% difference in signal strength for a given reference 

20 sequence. In general, a pattern match between a TEP and a REP includes a match in 

expression, preferably a match in qualitative or quantitative expression level, of at least one 
of, all or any subset of the differentially expressed genes of the invention. 

Pattern matching can be performed manually, or can be performed using a computer 
program. Methods for preparation of substrate matrices (e.g., arrays), design of 

25 oligonucleotides for use with such matrices, labeling of probes, hybridization conditions, 
scanning of hybridized matrices, and analysis of patterns generated, including comparison 
analysis, are described in, for example, U.S. 5,800,992. 

F. Use of the Polynucleotides of the Invention in Cancer 
Oncogenesis involves the unbridled growth, dedifferentiation and abnormal 

30 migration of cells. Cancerous cells can have the ability to compress, invade, and destroy 
normal tissue. Cancerous cells may also metastasize to other parts of the body via the 
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bloodstream or the lymph system and colonize in these other areas. Different cancers are 
classified by the cell from which the cancerous cell is derived and from its cellular 
morphology and/or state of differentiation. 

Somatic genetic abnormalities cause cancer initiation and progression. Cancer 
generally is clonally formed, i.e. gain of function of oncogenes and loss of function of tumor 
suppressor genes within a single cell transform the cell to be cancerous, and that single cell 
grows and divides to form a cancerous lesion. The genes known to be involved in cancer 
initiation and progression are involved in numerous cellular functions, including 
developmental differentiation, cell cycle regulation, cell signaling, immunological response, 
DNA replication, and DNA repair. 

The identification and characterization of genetic or biochemical markers in blood or 
tissues that will detect the earliest changes along the carcinogenesis pathway and monitor the 
efficacy of various therapies and preventive interventions is a major goal of cancer research. 
Scientists have identified genetic changes in stool specimens that indicate the stages of colon 
cancer, and other biomarkers such as gene mutations, hormone receptors, proteins that 
inhibit metastasis, and enzymes that metabolize drugs are all being used to determine the 
severity and predict the course of breast, prostate, lung, and other cancers. 

Recent advances in the pathogenesis of certain cancers has been helpful in 
determining patient treatment. The level of expression of certain polynucleotides can be 
indicative of a poorer prognosis, and therefore warrant more aggressive chemo- or radio- 
therapy for a patient. The correlation of novel surrogate tumor specific features with 
response to treatment and outcome in patients has defined certain prognostic indicators that 
allow the design of tailored therapy based on the molecular profile of the tumor. These 
therapies include antibody targeting and gene therapy. Moreover, a promising level of one 
or more marker polynucleotides can provide impetus for not aggressively treating a 
particular patient, thus sparing the patient the deleterious side effects of aggressive therapy. 
Determining expression of certain polynucleotides and comparison of a patients profile with 
known expression in normal tissue and variants of the disease allows a determination of the 
best possible treatment for a patient, both in terms of specificity of treatment and in terms of 
comfort level of the patient. 
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Surrogate tumor markers, such as polynucleotide expression, can also be used to 

better classify, and thus diagnose and treat, different forms and disease states of cancer. 

Two classifications widely used in oncology that can benefit from identification of the 

expression levels of the polynucleotides of the invention are staging of the cancerous 

5 disorder, and grading the nature of the cancerous tissue. 

Staging. Staging is a process used by physicians to describe how advanced the 

cancerous state is in a patient. Staging assists the physician in determining a prognosis, 

planning treatment and evaluating the results of such treatment. Different staging systems 

are used for different types of cancer, but each generally involves the following 

10 determinations: the type of tumor, indicated by T; whether the cancer has metastasized to 
nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. This system of staging is called the TNM system. 
Generally, if a cancer is only detectable in the area of the primary lesion without having 
spread to any lymph nodes it is called Stage I. If it has spread only to the closest lymph 

15 nodes, it is called Stage II. In Stage III, the cancer has generally spread to the lymph nodes 
in near proximity to the site of the primary lesion. Cancers that have spread to a distant part 
of the body, such as the liver, bone, brain or another site, are called Stage IV, the most 
advanced stage. 

Currently, the determination of staging is done using pathological techniques and is 
20 based more on the presence or absence of malignant tissue rather than the characteristics of 
the tumor type. Presence or absence of malignant tissue is based primarily on the gross 
morphology of the cells in the areas biopsied. The polynucleotides of the invention can 
facilitate fine-tuning of the staging process by identifying markers for the aggresivity of a 
cancer, e.g. the metastatic potential, as well as the presence in different areas of the body. 
25 Thus, a Stage II cancer with a polynucleotide signifying a high metastatic potential cancer 
can be used to change a borderline Stage II tumor to a Stage III tumor, justifying more 
aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
30 normal tissue of its same type. Based on the microscopic appearance of a tumor, 

pathologists will identify the grade of a tumor based on parameters such as cell morphology, 
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cellular organization, and other markers of differentiation. As a general rule, the grade of a 

tumor corresponds to its rate of growth or aggressiveness. < That is, undifferentiated or high- 
grade tumors grow more quickly than well differentiated or low-grade tumors. Information 
about tumor grade is usefiil in planning treatment and predicting prognosis. 
5 The American Joint Commission on Cancer has recommended the following 

guidelines for grading tumors: 1) GX Grade cannot be assessed; 2) Gl Well differentiated; 
G2 Moderately well differentiated; 3) G3 Poorly differentiated; 4) G4 Undifferentiated. 
Although grading is used by pathologists to describe most cancers, it plays a more important 
role in treatment planning for certain types than for others. An example is the Gleason 
1 0 system that is specific for prostate cancer, which uses grade numbers to describe the degree 
of differentiation. Lower Gleason scores indicate well-differentiated cells. Intermediate 
scores denote tumors with moderately differentiated cells. Higher scores describe poorly 
differentiated cells. Grade is also important in some types of brain tumors and soft tissue 
sarcomas. 

1 5 The polynucleotides of the invention can be especially valuable in determining the 

grade of the tumor, as they not only can aid in determining the differentiation status of the 
cells of a tumor, they can also identify factors other than differentiation that are valuable in 
determining the aggressivity of a tumor, such as metastatic potential. 

Familial Cancer Genes. A number of cancer syndromes are linked to Mendelian 

20 inheritance of a predisposition to develop particular cancers. The following table contains a 
list of cancer types that can be inherited, and for which the gene or genes responsible have 
been identified. Most of the cancer types listed can occur as part of several different genetic 
conditions, each caused by alterations in a different gene. 



Cancer Type 


Genetic Condition 


Gene 


Brain 




Li-Fraumeni syndrome 
Neurofibromatosis 1 
Neurofibromatosis 2 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


TP53 

NFl 

NF2 

VHL 

TSC2 


Breast 




Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 
Li-Fraumeni syndrome 
Ataxia telangiectasia 


BRCAl 
BRCA2 
TP53 
ATM 


Colon 


Familial adenomatous polyposis (FAP) 
Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 


APC 

HMSH2 

hMLHl 
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Cancer Type 


Genetic Condition 

Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


Gene 

hPMSl 
hPMS2 


Endocrine 

(parathyroid, pituitary, GI endocrine) 


Multiple endocrine neoplasia 1 (MEN!) 


K/fFKI 1 

ivihin i 


Endocrine 

(pheochromacytoma, medullary thyroid) 


Multiple endocrine neoplasia 2 (MEN2) 


RET 


Endometrial 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


Ki\yTQI-I9 

nivioiiZ 
hMLHl 
hPMSl 
nr ivio-i 


Eye 


Hereditary retinoblastoma 


RBI 


Hematologic 

(lymphomas and leukemia) 


Li-Fraumeni syndrome 
Ataxia telangiectasia 


1 rjj 

ATM 


Kidney 


Hereditary Wilms 1 tumor 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


W I 1 
VHL 
TSC2 


Ovary 


Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 


BRLA1 
BRCA2 


Sarcoma 


Hereditary retinoblastoma 
Li-Fraumeni syndrome 
Neurofibromatosis 1 


RBI 

TP53 

NF1 


Skin 


Hereditary melanoma 1 
Hereditary melanoma 2 
Basal cell naevus (Gorlin) syndrome 


CDKN2 

CDK4 

PTCH 


Stomach 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 



The polynucleotides of the invention can be especially useful to monitor patients having any 
of the above syndromes to detect potentially malignant events at a molecular level before 
they are detectable at a gross morphological level. As can be seen from the table, a number 
5 of genes are involved in multiple forms of cancer. Thus, a polynucleotide of the invention 
identified as important for metastatic colon cancer can also have clinical implications for a 
patient diagnosed with stomach cancer or endometrial cancer. 

Lung Cancer. Lung cancer is one of the most common cancers in the United States, 
accounting for about 15 percent of all cancer cases, or 170,000 new cases each year. At this 
10 time, over half of the lung cancer cases in the United States are in men, but the number found 
in women is increasing and will soon equal that in men. Today more women die of lung 
cancer than of breast cancer. Lung cancer is especially difficult to diagnose and treat 
because of the large size of the lungs, which allows cancer to develop for years undetected. 
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In fact, lung cancer can spread outside the lungs without causing any symptoms. Adding to 
the confusion, the most common symptom of lung cancer, a persistent cough, can often be 
mistaken for a cold or bronchitis. 

Although there are more than a dozen different kinds of lung cancer, the two main 
5 types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all 

lung cancer cases. Small cell carcinoma (also called oat cell carcinoma), which usually starts 
in one of the larger bronchial tubes, grows fairly rapidly, and is likely to be large by the time 
of diagnosis. Nonsmall cell lung cancer (NSCLC) is made up of three general subtypes of 
lung cancer. Epidermoid carcinoma (also called squamous cell carcinoma) usually starts in 

10 one of the larger bronchial tubes and grows relatively slowly. The size of these tumors can 
range from very small to quite large. Adenocarcinoma starts growing near the outside 
surface of the lung and can vary in both size and growth rate. Some slowly growing 
adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. 

15 Other less common forms of ljing cancer are carcinoid, cylindroma, mucoepidermoid, and 
malignant mesothelioma. 

Currently, CT scans, MRIs, X-rays, sputum cytology, and biopsies are used to 
diagnose nonsmall cell lung cancer. The form and cellular origin of the lung cancer is 
diagnosed primarily through biopsy from either a surgical biopsy or a needle aspiration of 

20 lung tissue, and usually the biopsy is prompted from an abnormality identified on an X-ray. 
In some cases, sputum cytology can reveal lung cancers in patients with normal X-rays or 
can determine the type of lung cancer, but because it cannot pinpoint the tumor's location, a 
positive sputum cytology test is usually followed by further tests. Since these tests are based 
in large part on gross morphology of the tissue, the diagnosis of a particular kind of tumor is 

25 largely subjective, and the diagnosis can vary significantly between clinicians. 

The polynucleotides of the invention can be used to distinguish types of lung cancer 
as well as identifying traits specific to a certain patient's cancer. For example, if the patient's 
biopsy expresses a polynucleotide that is associated with a low metastatic potential, it may 
justify leaving a larger portion of the patient's lung in surgery to remove the lesion. 

30 Alternatively, a smaller lesion with expression of a polynucleotide that is associated with 
high metastatic potential may justify a more radical removal of lung tissue and/or the 



66 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 PCT/US98/2761 0 

surrounding lymph nodes, even if no metastasis can be identified through pathological 

examination. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
5 polynucleotide in hyperplasia can be used as a diagnostic marker for metastatic lung cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
those that exhibit differential expression between high metastatic versus low metastatic lung 
cancer , i.e. SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 381, 395, and 400. Detection of malignant lung cancer with a higher 
10 metastatic potential can be determined using expression levels of any of these sequences 
alone or in combination with the levels of expression of other known genes. 

Breast Cancer. The National Cancer Institute (NCI) estimates that about 1 in 8 
women in the United States will develop breast cancer during her lifetime. Clinical breast 
examination and mammography are recommended as combined modalities for breast cancer 
1 5 screening, and the nature of the cancer will often depend upon the location of the tumor and 
the cell type from which the tumor is derived. The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 

Ductal carcinoma in situ (DCIS): Ductal carcinoma in situ is the most common type 
of noninvasive breast cancer. In DCIS, the malignant cells have not metastasized through 
20 the walls of the ducts into the fatty tissue of the breast. Comedocarcinoma is a type of DCIS 
that is more likely than other types of DCIS to come back in the same area after 
lumpectomy. It is more closely linked to eventual development of invasive ductal carcinoma 
than other forms of DCIS. 

Infiltrating (or invasive) ductal carcinoma (IDC): this type of cancer has metastasized 
25 through the wall of the duct and invaded the fatty tissue of the breast. At this point, it has the 
potential to use the lymphatic system and bloodstream for metastasis to more distant parts of 
the body. Infiltrating ductal carcinoma accounts for about 80% of breast cancers. 

Lobular carcinoma in situ (LCIS): While not a true cancer, LCIS (also called lobular 
neoplasia) is sometimes classified as a type of noninvasive breast cancer. It does not 
30 penetrate through the wall of the lobules. Although it does not itself usually become an 
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invasive cancer, women with this condition have a higher risk of developing an invasive 
breast cancer in the same breast, or in the opposite breast. 

Infiltrating (or invasive) lobular carcinoma (ILC): ILC is similar to IDC, in that it has 
the potential metastasize elsewhere in the body. About 10% to 15% of invasive breast 
5 cancers are invasive lobular carcinomas. ILC can be more difficult to detect by 
mammogram than IDC. 

Inflammatory breast cancer: This rare type of invasive breast cancer accounts for 
about 1% of all breast cancers and is extremely aggressive. Multiple skin symptoms 
associated with this cancer are caused by cancer cells blocking lymph vessels or channels in 
10 the skin over the breast. 

Medullary carcinoma: This special type of infiltrating breast cancer has a relatively 
well defined, distinct boundary between tumor tissue and normal tissue. It accounts for 
about 5% of breast cancers. The prognosis for this kind of breast cancer is better than for 
other types of invasive breast cancer. 
1 5 Mucinous carcinoma: This rare type of invasive breast cancer originates from mucus- 

producing cells. The prognosis for mucinous carcinoma is better than for the more common 
types of invasive breast cancer. 

Paget' s disease of the nipple: This type of breast cancer starts in the ducts and spreads 
to the skin of the nipple and the areola. It is a rare type of breast cancer, occurring in only 
20 1% of all cases. Paget' s disease can be associated with in situ carcinoma, or with infiltrating 
breast carcinoma. If no lump can be felt in the breast tissue, and the biopsy shows DCIS but 
no invasive cancer, the prognosis is excellent. 

Phyllodes tumor: This very rare type of breast tumor forms from the stroma of the 
breast, in contrast to carcinomas which develop in the ducts or lobules. Phyllodes (also 
25 spelled phylloides) tumors are usually benign, but are malignant on rare occasions. 

Nevertheless, malignant phyllodes tumors are very rare and less than 10 women per year in 
the US die of this disease. Benign phyllodes tumors are successfully treated by removing the 
mass and a narrow margin of normal breast tissue. 



30 carcinomas are a special type of infiltrating breast carcinoma. They have a better prognosis 
than usual infiltrating ductal or lobularcarcinomas. 



Tubular carcinoma: Accounting for about 2% of all breast cancers, tubular 
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High-quality mammography combined with clinical breast exam remains the only 
screening method clearly tied to reduction in breast cancer mortality. Lower dose x-rays, 
digitized computer rather than film images, and the use of computer programs to assist 
diagnosis, are almost ready for widespread dissemination. Other technologies also are being 
5 developed, including magnetic resonance imaging and ultrasound. In addition, a very low 
radiation exposure technique, positron emission tomography has the potential for detecting 
early breast cancer. 

It is also possible to differentiate between non-cancerous breast tissue and malignant 
breast tissue by analyzing differential gene expression between tissues. In addition, there 

10 may be several possible alterations that lead to the various possible types of breast cancer. 
The different types of breast tumors (e.g., invasive vs. non-invasive, ductal vs. axillary 
lymph node) can be differentiable from one another by the identification of the differences in 
genes expressed by different types of breast tumor tissues (Porter- Jordan et aL, Hematol 
Oncol Clin North Am (1994) 8:73). Breast cancer can thus be generally diagnosed by 

1 5 detection of expression of a gene or genes associated with breast tumors. Where enough 
information is available about the differential gene expression between various types of 
breast tumor tissues, the specific type of breast tumor can also be diagnosed. 

For example, increased estrogen receptor (ER) expression in normal breast 
epithileum, while not itself indicative of malignant tissue, is a known risk marker for 

20 development of breast cancer. Khan SA et al. 9 Cancer Res (1994) J4:993. Malignant breast 
cancer is often divided into two groups, ER-positive and ER-negative, based on the estrogen 
receptor status of the tissue. The ER status represents different survival length and response 
to hormone therapy, and is thought to represent either: 1 ) an indicator of different stages of 
the disease, or 2) an indicator that allows differentiation between two similar but distinct 

25 diseases. K. Zhu et al, Med. Hypoth. (1997) 49:69. A number of other genes are known to 
vary expression between either different stages of cancer or different types of similar breast 
cancer. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis and management of breast cancer. The differential expression of a polynucleotide 
30 in human breast tumor tissue can be used as a diagnostic marker for human breast cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
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those that exhibit differential expression between breast cancer tissue with a high metastatic 

potential and a low metastatic potential, i.e. SEQ ID NOS: 9, 42, 52, 62, 65, 66, 68, 1 14, 

123, 144, 172, 178, 214, 219, 223, 258, 317, and 379. Detection of breast cancer can be 

determined using expression levels of any of these sequences alone or in combination. 

5 Determination of the aggressive nature and/or the metastatic potential of a breast cancer can 

also be determined by comparing levels of one or more polynucleotides of the invention and 

comparing levels of another sequence known to vary in cancerous tissue, e.g. ER expression. 

In addition, development of breast cancer can be detected by examining the ratio of SEQ ID 

NO: to the levels of steroid hormones {e.g., testosterone or estrogen) or to other hormones 

10 {e.g. , growth hormone, insulin). Thus expression of specific marker polynucleotides can 
be used to discriminate between normal and cancerous breast tissue, to discriminate between 
breast cancers with different cells of origin, to discriminate between breast cancers with 
different potential metastatic rates, etc. 

Diagnosis of breast cancer can also involve comparing the expression of a 

15 polynucleotide of the invention with the expression of other sequences in non-malignant 
breast tissue samples in comparison to one or more forms of the diseased tissue. A 
comparison of expression of one or more polynucleotides of the invention between the 
samples provides information on relative levels of these polynucleotides as well as the ratio 
of these polynucleotides to the expression of other sequences in the tissue of interest 

20 compared to normal. 

This risk of breast cancer is elevated significantly by the presence of an inherited risk 
for breast cancer, such as a mutation in BRCA-1 or BRCA-2. New diagnostic tools are 
being developed to address the needs of higher risk patients to complement mammography 
and physical examinations for early detection of breast cancer, particularly among younger 

25 women. The presence of antigen or expression markers in nipple aspirate fluid (NAF) 

samples collected from one or both breasts can be useful for useful for risk assessment or 
early cancer detection. Breast cytology and biomarkers obtained by random fine needle 
aspiration have been used to identify hyperplasia with atypia and overexpression of p53 and 
EGFR. The polynucleotides of the invention can be used in multivariate analysis with 

30 expression studies with genes such as p53 and EGFR as risk predictors and as surrogate 
endpoint biomarkers for breast cancer. 
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As well as being used for diagnosis and risk assessment, the expression of certain 
genes can also correlated to prognosis of a disease state. The expression of particular gene 
have been used as prognostic indicators for breast cancer including increased expression of 
c-erbB-2, pS2, ER, progesterone receptor, epidermal growth factor receptor (EGFR), neu, 
5 myc, bcl-2, int2, cytosolic tyrosine kinase, cyclin E,prad-J, hst, uPA, PAI-1, PAI-2, 

cathepsin D, as well as the presence of a number of cancer-specific antigens, e.g. CEA, CA 
M26, CA M29 and CA 15.3. Davis, Br. J. BiomedSci. (1996) 53:157. Poor prognosis has 
also been linked to a decrease in expression of certain genes, such as p53, Rb, nm23. The 
expression of the polynucleotides of the invention can be of prognostic value for determining 

10 the metastatic potential of a malignant breast cancer, as this molecules are differentially 
expressed between high and low metastatic potential tissues tumors. The levels of these 
polynucleotides in patients with malignant breast cancer can compared to normal tissue, 
malignant tissue with a known high potential metastatic level, and malignant tissue with a 
known lower level of metastatic potential to provide a prognosis for a particular patient. 

1 5 Such a prognosis is predictive of the extent and nature of the cancer. The determined 

prognosis is useful in determining the prognosis of a patient with breast cancer, both for 
initial treatment of the disease and for longer-term monitoring of the same patient. If 
samples are taken from the same individual over a period of time, differences in 
polynucleotide expression that are specific to that patient can be identified and closely 

20 watched. 

Colon Cancer. Colorectal cancer is one of the most common neoplasms in humans 
and perhaps the most frequent form of hereditary neoplasia. Prevention and early detection 
are key factors in controlling and curing colorectal cancer. Indeed, colorectal cancer is the 
second most preventable cancer, after lung cancer. Colorectal cancer begins as polyps, 

25 which are small, benign growths of cells that form on the inner lining of the colon. Over a 
period of several years, some of these polyps accumulate additional mutations and become 
cancerous. About 20 percent of all cases of colon cancer are thought to be related to 
heredity. Currently, multiple familial colorectal cancer disorders have been identified, which 
are summarized as follows: 

30 Familial adenomatous polyposis (FAP): This condition results in a person having 

hundreds or even thousands of polyps in the colon and rectum that usually first appear during 
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the teenage years. Cancer nearly always develops in one or more of these polyps between 
the ages of 30 and 50. 

Gardner's syndrome: Like FAP, Gardner's syndrome results in polyps and colorectal 
cancers that develop at a young age. It can also cause benign tumors of the skin, soft 
5 connective tissue and bones. 

Hereditary nonpolyposis colon cancer (HNPCC): People with this condition tend to 
develop colorectal cancer at a young age, without first having many polyps. HNPCC has an 
autosomal dominant pattern of inheritance with variable but high penetrance estimated to be 
about 90%. HNPCC underlies 0.5%- 10% of all cases of colorectal cancer. An understanding 
1 0 of the mechanisms behind the development of HNPCC is emerging, and genetic 

presymptomatic testing, now being conducted in research settings, soon will be available on 
a widespread basis for individuals identified at risk for this disease. 

Familial colorectal cancer in Ashkenazi Jews: Recent research has found an inherited 
tendency to developing colorectal cancer among some Jews of Eastern European descent. 
1 5 Like people with FAP, Gardner's syndrome, and HNPCC, their increased risk is due to an 
inherited mutation present in about 6% of American Jews. 

Several tests are currently used to screen for colorectal cancer, including digital rectal 
examination, fecal occult blood test, sigmoidoscopy, colonoscopy, virtual colonoscopy and 
MRI. Each of these tests identifies potential colorectal cancer lesions, or a risk of 
20 development of these lesions, at a fairly gross morphological level. 

The sequential alteration of a number of genes is associated with malignant 
adenocarcinoma, including the genes DCC, p53, ras, and FAP. For a review, see e.g. Fearon 
ER, et aL, Cell (1990) 61(5):759; Hamilton SR et al. 9 Cancer (1993) 72:957; Bodmer W, et 
al 9 Nat Genet. (1994) 4(3):2\1\ Fearon ER, Ann N YAcadSci. (1995) 768:101. Molecular 
25 genetic alterations are thus promising as potential diagnostic and prognostic indicators in 
colorectal carcinoma and molecular genetics of colorectal carcinoma since it is possible to 
differentiate between different types of colorectal neoplasias using molecular markers. 
Colorectal cancer can thus be generally diagnosed by detection of expression of a gene or 
genes associated with colorectal tumors. 
30 Similarly, the expression of polynucleotides of the invention can be used in the 

diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
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polynucleotide in hyperplasia can be used as a diagnostic marker for colon cancer. The 
polynucleotides of the invention that would be especially useful for this purpose are those 
that exhibit differential expression between malignant metastatic colon cancer and normal 
patient tissue , i.e. SEQ ID NOS: 52, 119, 172, 288. Detection of malignant colon cancer 
5 can be determined using expression levels of any of these sequences alone or in combination 
with the levels of expression. 

Determination of the aggressive nature and/or the metastatic potential of a colon 
cancer can also be determined by comparing levels of one or more polynucleotides of the 
invention and comparing total levels of another sequence known to vary in cancerous tissue, 

10 e.g. p53 expression. In addition, development of colon cancer can be detected by examining 
the ratio of any of the polynucleotides of the invention to the levels of oncogenes (e.g. ras) 
or tumor suppressor genes (e.g. FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous breast tissue, to 
discriminate between breast cancers with different cells of origin, to discriminate between 

15 breast cancers with different potential metastatic rates, etc. 

G. Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
Polypeptides encoded by the instant polynucleotides and corresponding full length 
genes can be used to screen peptide libraries to identify binding partners, such as receptors, 
from among the encoded polypeptides. 

20 A library of peptides can be synthesized following the methods disclosed in U.S. Pat. 

No. 5,010,175 0175), and in WO 91/17823. As described below in brief, one prepares a 
mixture of peptides, which is then screened to identify the peptides exhibiting the desired 
signal transduction and receptor binding activity. In the '175 method, a suitable peptide 
synthesis support (e.g., a resin) is coupled to a mixture of appropriately protected, activated 

25 amino acids. The concentration of each amino acid in the reaction mixture is balanced or 

adjusted in inverse proportion to its coupling reaction rate so that the product is an equimolar 
mixture of amino acids coupled to the starting resin. The bound amino acids are then 
deprotected, and reacted with another balanced amino acid mixture to form an equimolar 
mixture of all possible dipeptides. This process is repeated until a mixture of peptides of the 

30 desired length (e.g., hexamers) is formed. Note that one need not include all amino acids in 
each step: one can include only one or two amino acids in some steps (e.g., where it is 
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known that a particular amino acid is essential in a given position), thus reducing the 
complexity of the mixture. After the synthesis of the peptide library is completed, the 
mixture of peptides is screened for binding to the selected polypeptide. The peptides are 
then tested for their ability to inhibit or enhance activity. Peptides exhibiting the desired 
5 activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, instead of reacting the 
synthesis resin with a mixture of activated amino acids, the resin is divided into twenty equal 
portions (or into a number of portions corresponding to the number of different amino acids 
to be added in that step), and each amino acid is coupled individually to its portion of resin. 
10 The resin portions are then combined, mixed, and again divided into a number of equal 

portions for reaction with the second amino acid. In this manner, each reaction can be easily 
driven to completion. Additionally, one can maintain separate "subpools" by treating 
portions in parallel, rather than combining all resins at each step. This simplifies the process 
of determining which peptides are responsible for any observed receptor binding or signal 
1 5 transduction activity. 

In such cases, the subpools containing, e.g., 1-2,000 candidates each are exposed to 
one or more polypeptides of the invention. Each subpool that produces a positive result is 
then resynthesized as a group of smaller subpools (sub-subpools) containing, e.g., 20-100 
candidates, and reassayed. Positive sub-subpools can be resynthesized as individual 
20 compounds, and assayed finally to determine the peptides that exhibit a high binding 
constant. These peptides can be tested for their ability to inhibit or enhance the native 
activity. The methods described in WO 91/7823 and U.S. Patent No. 5,194,392 (herein 
incorporated by reference) enable the preparation of such pools and subpools by automated 
techniques in parallel, such that all synthesis and resynthesis can be performed in a matter of 



Peptide agonists or antagonists are screened using any available method, such as 
signal transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis 
assays, etc. The methods described herein are presently preferred. The assay conditions 
ideally should resemble the conditions under which the native activity is exhibited in vivo, 
30 that is, under physiologic pH, temperature, and ionic strength. Suitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity at 



25 



days. 



74 



BNSDOCID: <WO 9933982A2_I_> 



• 



PCT/US98/27610 



WO 99/33982 



concentrations that do not cause toxic side effects in the subject. Agonists or antagonists that 
compete for binding to the native polypeptide can require concentrations equal to or greater 
than the native concentration, while inhibitors capable of binding irreversibly to the 
polypeptide can be added in concentrations on the order of the native concentration. 



polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA corresponding 
to a polynucleotide of the invention, and at least one peptide agonist or antagonist of the 
novel binding partner. Such agonists and antagonists can be used to modulate, enhance, or 
inhibit receptor function in cells to which the receptor is native, or in cells that possess the 
10 receptor as a result of genetic engineering. Further, if the novel receptor shares biologically 
important characteristics with a known receptor, information about agonist/antagonist 
binding can facilitate development of improved agonists/antagonists of the known receptor. 



15 polynucleotides of the claimed invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 

20 detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount for a subject 
will depend upon the subjects size and health, the nature and extent of the condition, and the 
therapeutics or combination of therapeutics selected for administration. Thus, it is not useful 

25 to specify an exact effective amount in advance. However, the effective amount for a given 
situation is determined by routine experimentation and is within the judgment of the 
clinician. For purposes of the present invention, an effective dose will generally be from 
about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in 
the individual to which it is administered. 

30 A pharmaceutical composition can also contain a pharmaceutical ly acceptable carrier. 

The term "pharmaceutical^ acceptable carrier" refers to a carrier for administration of a 



5 



The end results of such screening and experimentation will be at least one novel 



H. Pharmaceutical Compositions and Therapeutic Uses 
Pharmaceutical compositions can comprise polypeptides, antibodies, or 
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therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which can be 
administered without undue toxicity. Suitable carriers can be large, slowly metabolized 
5 macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers 
are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
10 organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions can include liquids 
such as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or 
15 emulsifying agents, pH buffering substances, and the like, can also be present in such 

vehicles. Typically, the therapeutic compositions are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection can also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 
20 Delivery Methods. Once formulated, the compositions of the invention can be 

(1) administered directly to the subject (e.g., as polynucleotide or polypeptides); (2) 
delivered ex vivo, to cells derived from the subject (e.g., as in ex vivo gene therapy); or (3) 
delivered in vitro for expression of recombinant proteins (e.g., polynucleotides). Direct 
delivery of the compositions will generally be accomplished by injection, either 
25 subcutaneously, intraperitoneal^, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or 
lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment can be a single dose schedule or a multiple dose schedule. 
30 Methods for the ex vivo delivery and reimplantation of transformed cells into a 

subject are known in the art and described in e.g., International Publication No. WO 
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93/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 
particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 
Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by, for example, dextran-mediated transfection, calcium phosphate 



encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been found to 
correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the 

10 disorder can be amenable to treatment by administration of a therapeutic agent based on the 
provided polynucleotide or corresponding polypeptide. 

Preparation of antisense polynucleotides is discussed above. Neoplasias that are 
treated with the antisense composition include, but are not limited to, cervical cancers, 
melanomas, colorectal adenocarcinomas, Wilms' tumor, retinoblastoma, sarcomas, 

1 5 myosarcomas, lung carcinomas, leukemias, such as chronic myelogenous leukemia, 

promyelocytic leukemia, monocytic leukemia, and myeloid leukemia, and lymphomas, such 
as histiocytic lymphoma. Proliferative disorders that are treated with the therapeutic 
composition include disorders such as anhydric hereditary ectodermal dysplasia, congenital 
alveolar dysplasia, epithelial dysplasia of the cervix, fibrous dysplasia of bone, and 

20 mammary dysplasia. Hyperplasias, for example, endometrial, adrenal, breast, prostate, or 
thyroid hyperplasias or pseudoepitheliomatous hyperplasia of the skin, are treated with 
antisense therapeutic compositions based upon a polynucleotide of the invention. Even in 
disorders in which mutations in the corresponding gene are not implicated, downregulation 
or inhibition of expression of a gene corresponding to a polynucleotide of the invention can 

25 have therapeutic application. For example, decreasing gene expression can help to suppress 
tumors in which enhanced expression of the gene is implicated. 

Both the dose of the antisense composition and the means of administration are 
determined based on the specific qualities of the therapeutic composition, the condition, age, 
and weight of the patient, the progression of the disease, and other relevant factors. 

30 Administration of the therapeutic antisense agents of the invention includes local or systemic 
administration, including injection, oral administration, particle gun or catheterized 



5 precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 
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administration, and topical administration. Preferably, the therapeutic antisense composition 
contains an expression construct comprising a promoter and a polynucleotide segment of at 
least 12, 22, 25, 30, or 35 contiguous nucleotides of the antisense strand of a polynucleotide 
disclosed herein. Within the expression construct, the polynucleotide segment is located 
5 downstream from the promoter, and transcription of the polynucleotide segment initiates at 
the promoter. 

Various methods are used to administer the therapeutic composition directly to a 
specific site in the body. For example, a small metastatic lesion is located and the 
therapeutic composition injected several times in several different locations within the body 

10 of tumor. Alternatively, arteries which serve a tumor are identified, and the therapeutic 

composition injected into such an artery, in order to deliver the composition directly into the 
tumor. A tumor that has a necrotic center is aspirated and the composition injected directly 
into the now empty center of the tumor. The antisense composition is directly administered 
to the surface of the tumor, for example, by topical application of the composition. X-ray 

1 5 imaging is used to assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing an 
antisense polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues is 
also used. Receptor-mediated DNA delivery techniques are described in, for example, 
Findeis et al. y Trends Biotechnol (1993) 11:202; Chiou et aL, Gene Therapeutics: Methods 

20 And Applications Of Direct Gene Transfer (J. A. Wolff, ed.) (1994); Wu et al. 9 J. Biol. Chem. 
(1988) 263:621; Wu et al 9 J. Biol Chem. (1994) 269:542; Zenke et al, Proc. Natl Acad 
ScL (USA) (1990) 57:3655; Wu et ah, J* Biol Chem. (1991) 26633S. Preferably, receptor- 
mediated targeted delivery of therapeutic compositions containing antibodies of the 
invention is used to deliver the antibodies to specific tissue. 

25 Therapeutic compositions containing antisense subgenomic polynucleotides are 

administered in a range of about 100 ng to about 200 mg of DNA for local administration in 
a gene therapy protocol. Concentration ranges of about 500 ng to about 50 mg, about 1 fig to 
about 2 mg, about 5 jag to about 500 fig, and about 20 fag to about 100 fag of DNA can also 
be used during a gene therapy protocol. Factors such as method of action and efficacy of 

30 transformation and expression are considerations which will affect the dosage required for 
ultimate efficacy of the antisense subgenomic polynucleotides. Where greater expression is 
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desired over a larger area of tissue, larger amounts of antisense subgenomic polynucleotides 

or the same amounts readministered in a successive protocol of administrations, or several 

administrations to different adjacent or close tissue portions of, for example, a tumor site, 

may be required to effect a positive therapeutic outcome. In all cases, routine 

5 experimentation in clinical trials will determine specific ranges for optimal therapeutic 

effect. A more complete description of gene therapy vectors, especially retroviral vectors, is 

contained in U.S. Serial No. 08/869,309, which is expressly incorporated herein, and in 

section G below. 

For polynucleotide-related genes encoding polypeptides or proteins with anti- 
10 inflammatory activity, suitable use, doses, and administration are described in U.S. Patent 
No. 5,654,173. Therapeutic agents also include antibodies to proteins and polypeptides 
encoded by the polynucleotides of the invention and related genes, as described in U.S. 
Patent No. 5,654,173. 

I. Gene Therapy 

1 5 The therapeutic polynucleotides and polypeptides of the present invention can be 

utilized in gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral 
origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human Gene 
Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, Nature 
Genetics (1994) 6:148). Gene therapy vehicles for delivery of constructs including a coding 

20 sequence of a therapeutic of the invention can be administered either locally or systemically . 
These constructs can utilize viral or non-viral vector approaches. Expression of such coding 
sequences can be induced using endogenous mammalian or heterologous promoters. 
Expression of the coding sequence can be either constitutive or regulated. 

The present invention can employ recombinant retroviruses which are constructed to 

25 carry or express a selected nucleic acid molecule of interest. Retrovirus vectors that can be 
employed include those described in EP 0 415 731; WO 90/07936; WO 94/03622; WO 
93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/1 1230; WO 93/10218; Vile 
and Hart, Cancer Res. (1993) 55:3860; Vile et al, Cancer Res. (1993) 53:962; Ram et al., 
Cancer Res. (1993) 53:83; Takamiya et al, J. Neurosci. Res. (1992) 33:493; Baba et al., 

30 Neurosurg. (1993) 79:729; U.S. Patent No. 4,777,127; GB Patent No. 2,200,651; and EP 0 
345 242. Preferred recombinant retroviruses include those described in WO 91/02805. 
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Packaging cell lines suitable for use with the above-described retroviral vector 
constructs can be readily prepared (see, e.g., WO 95/30763 and WO 92/05266), and used to 
create producer cell lines (also termed vector cell lines) for the production of recombinant 
vector particles. Within particularly preferred embodiments of the invention, packaging cell 
5 lines are made from human (such as HT1080 cells) or mink parent cell lines, thereby 
allowing production of recombinant retroviruses that can survive inactivation in human 
serum. 

The present invention also employs alphavirus-based vectors that can function as 
gene delivery vehicles. Such vectors can be constructed from a wide variety of alphaviruses, 

10 including, for example, Sindbis virus vectors, Semliki forest virus (ATCC VR-67; ATCC 
VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan equine 
encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR-532). 
Representative examples of such vector systems include those described in U.S. Patent Nos. 
5,091,309; 5,217,879; and 5,185,440; WO 92/10578; WO 94/21792; WO 95/27069; WO 

1 5 95/27044; and WO 95/07994. Gene delivery vehicles of the present invention can also 

employ parvovirus such as adeno-associated virus (AAV) vectors. Representative examples 
include the AAV vectors disclosed by Srivastava in WO 93/09239, Samulski et al., J. Virol. 
(1989) 65:3822; Mendelson et al„ Virol (1988) 766:154; and Flotte et al, PNAS (1993) 
90:10613. 

20 Representative examples of adenoviral vectors include those described by Berkner, 

Biotechniques (1988) 6:616; Rosenfeld et aL, Science (1991) 252:431; WO 93/19191; Kolls 
et al, PNAS (1994) P7:215; Kass-Eisler et aL, PNAS (1993) 90: 1 1498; Guzman et al, 
Circulation (1993) 55:2838; Guzman et aL, Cir. Res. (1993) 73:1202; Zabner et aL 9 Cell 
(1993) 75:207; Li et aL, Hum. Gene Ther. (1993) 4:403; Cailaud et aL, Eur. J. NeuroscL 

25 (1993) 5:1287; Vincent et al, Nat. Genet. (1993) 5:130; Jaffe et aL, Nat. Genet. (1992) 

7:372; and Levrero et al, Gene (1991) 707:195. Exemplary adenoviral gene therapy vectors 
employable in this invention also include those described in WO 94/12649, WO 93/03769; 
WO 93/19191; WO 94/28938; WO 95/1 1984 and WO 95/00655. Administration of DNA 
linked to killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 3:147 can be 

30 employed. 
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Other gene delivery vehicles and methods can be employed, including polycationic 
condensed DNA linked or unlinked to killed adenovirus alone, for example Curieh Hum. 
Gene Ther. (1992) 3:147; ligand linked DNA, for example see Wu, J. Biol. Chem. (1989) 
264:16985; eukaryotic cell delivery vehicles cells, for example see U.S. Pat. No. 5,814,482; 
5 WO 95/07994; WO 96/17072; WO 95/30763; and WO 97/42338; deposition of 

photopolymerized hydrogel materials; hand-held gene transfer particle gun, as described in 
U.S. Patent No. 5,149,655; ionizing radiation as described in U.S. Patent No. 5,206,152 and 
in W092/1 1033; nucleic charge neutralization or fusion with cell membranes. Additional 
approaches are described in Philip, Mol Cell Biol (1994) 14:2AM, and in Woffendin, Proc. 

1 0 Natl Acad. Sci. ( 1 994) 91 : 1 58 1 . 

Naked DNA can also be employed. Exemplary naked DNA introduction methods are 
described in WO 90/1 1092 and U.S. Patent No. 5,580,859. Uptake efficiency can be 
improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method can be improved 

15 further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. Liposomes that can act as gene 
delivery vehicles are described in U.S. Patent No. 5,422,120; WO 95/13796; WO 94/23697; 
WO 91/14445; and EP 0524968. 

Further non- viral delivery suitable for use includes mechanical delivery systems such 

20 as the approach described in Woffendin et al, Proc. Natl Acad. Sci. USA (1994) 

P/(24):l 1581. Moreover, the coding sequence and the product of expression of such can be 
delivered through deposition of photopolymerized hydrogel materials. Other conventional 
methods for gene delivery that can be used for delivery of the coding sequence include, for 
example, use of hand-held gene transfer particle gun, as described in U.S. Patent No. 

25 5,149,655; use of ionizing radiation for activating transferred gene, as described in U.S. 
Patent No. 5,206,152 and WO 92/11033. 

The present invention will now be illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, it should be noted that 
these embodiments are illustrative and are not to be construed as restricting the invention in 

30 any way. 
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EXAMPLES 



The present invention is now illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, these embodiments are 
illustrative and are not meant to be construed as restricting the invention in any way. 

5 

Example 1 : Source of Biological Materials and Overview of Novel Polynucleotides 
Expressed by the Biological Materials 
Human colon cancer cell line Kml2L4-A (Morika, W. A. K. et al., Cancer Research 
(1988) 45:6863) was used to construct a cDNA library from mRNA isolated from the cells. 

10 As described in the above overview, a total of 4,693 sequences expressed by the Kml2L4-A 
cell line were isolated and analyzed; most sequences were about 275-300 nucleotides in 
length. The KM12L4-A cell line is derived from the KM12C cell line. The KM12C cell 
line, which is poorly metastatic (low metastatic) was established in culture from a Dukes' 
stage B 2 surgical specimen (Morikawa et al Cancer Res. (1988) 48:6863). The KML4-A is 

15 a highly metastatic subline derived from KM12C (Yeatman et al Nucl Acids. Res. (1995) 
23:4007; Bao-Ling et al Proc. Annu. Meet. Am. Assoc. Cancer. Res. (1995) 27:3269). The 
KM12C and KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well- 
recognized in the art as a model cell line for the study of colon cancer (see, e.g., Moriakawa 
et al, supra; Radinsky et al Clin. Cancer Res. (1995) 7:19; Yeatman et al, (1995) supra; 

20 Yeatman et al Clin. Exp. Metastasis (1996) 74:246). 

The sequences were first masked to eliminate low complexity sequences using the XBLAST 
masking program (Claverie "Effective Large-Scale Sequence Similarity Searches," In: 
Computer Methods for Macromolecular Sequence Analysis . Doolittle, ed., Meth. Enzymol 
266:212-227 Academic Press, NY, NY (1996); see particularly Claverie, in "Automated 

25 DNA Sequencing and Analysis Techniques" Adams et al, eds., Chap. 36, p. 267 Academic 
Press, San Diego, 1 994 and Claverie et al Comput. Chem. (1993) 17:191 ). Generally, 
masking does not influence the final search results, except to eliminate of relative little 
interest due to their lox complexity, and to eliminate multiple "hits" based on similarity to 
repetitive regions common to multiple sequences, e.g., Alu repeats. Masking resulted in the 

30 elimination of 43 sequences. The remaining sequences were then used in a BLASTN vs. 
Genbank search with search parameters of greater than 70% overlap, 99% identity, and a p 
value of less than 1 x 10" 40 , which search resulted in the discarding of 1,432 sequences. 
Sequences from this search also were discarded if the inclusive parameters were met, but the 
sequence was ribosomal or vector-derived. 
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The resulting sequences from the previous search were classified into three groups 

(1, 2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database 

search: (1) unknown (no hits in the Genbank search), (2) weak similarity (greater than 45% 

identity and p value of less than 1 x 10" 5 ), and (3) high similarity (greater than 60% overlap, 

5 greater than 80% identity, and p value less than 1 x 10" 5 ). This search resulted in discard of 

98 sequences as having greater than 70% overlap, greater than 99% identity, and p value of 

less than 1 x 10" 40 . 

The remaining sequences were classified as unknown (no hits), weak similarity, and 
high similarity (parameters as above). Two searches were performed on these sequences. 

10 First, a BLAST vs. EST database search resulted in discard of 1771 sequences (sequences 
with greater than 99% overlap, greater than 99% similarity and a p value of less than 1x10" 
40 ; sequences with a p value of less than 1 x 1 0" 65 when compared to a database sequence of 
human origin were also excluded). Second, a BLASTN vs. Patent GeneSeq database 
resulted in discard of 15 sequences (greater than 99% identity; p value less than 1 x 1 0" 40 ; 

1 5 greater than 99% overlap). 

The remaining sequences were subjected to screening using other rules and 
redundancies in the dataset. Sequences with a p value of less than 1 x 1 0 _1 11 in relation to a 
database sequence of human origin were specifically excluded. The final result provided the 
404 sequences listed in the accompanying Sequence Listing. The Sequence Listing is 

20 arranged beginning with sequences with no similarity to any sequence in a database 
searched, and ending with sequences with the greatest similarity. Each identified 
polynucleotide represents sequence from at least a partial mRNA transcript. Polynucleotides 
that were determined to be novel were assigned a sequence identification number. 

The novel polynucleotides and were assigned sequence identification numbers SEQ 

25 ID NOS: 1-404. The DNA sequences corresponding to the novel polynucleotides are 
provided in the Sequence Listing. The majority of the sequences are presented in the 
Sequence Listing in the 5' to 3' direction. A small number, 25, are listed in the Sequence 
Listing in the 5' to 3' direction but the sequence as written is actually V to 5\ These 
sequences are readily identified with the designation "AR" in the Sequence Name in Table 1 

30 (inserted before the claims). The sequences correctly listed in the 5' to 3' direction in the 
Sequence Listing are designated "AF." The Sequence Listing filed herewith therefore 
contains 25 sequences listed in the reverse order, namely SEQ ID NOS:47, 97, 137, 171, 
173, 179, 182, 194, 200, 202, 213, 227, 258, 264, 275, 302, 313, 324, 329, 330, 331, 338, 
358, 379, and 404. 
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Because the provided polynucleotides represent partial mRNA transcripts, two or 
more polynucleotides of the invention may represent different regions of the same mRNA 
transcript and the same gene. Thus, if two or more SEQ ID NOS: are identified as belonging 
to the same clone, then either sequence can be used to obtain the full-length mRNA or gene. 
5 In order to confirm the sequences of SEQ ID NOS: 1 -404, inserts of the clones 

corresponding to these polynucleotides were re-sequenced. These "validation" sequences 
are provided in SEQ ID NOS:405-800. These validation sequences were often longer than 
the original polynucleotide sequences. They validate, and thus often provide additional 
sequence information. Validation sequences can be correlated with the original sequences 
1 0 they validate by identifying those sequences of SEQ ID NOS: 1 -404 and the validation 
sequences of SEQ ID NOS:405-800 that share the same clone name in Table 1 . 

Example 2: Results of Public Database Search to Identify Function of Gene Products 



1 5 translated in all three reading frames to determine the best alignment with the individual 

sequences. These amino acid sequences and nucleotide sequences are referred, generally, as 
query sequences, which are aligned with the individual sequences. Query and individual 
sequences were aligned using the BLAST programs, available over the world wide web at 
http://ww. ncbi.nlm.nih.gov/BLAST/ . Again the sequences were masked to various extents 

20 to prevent searching of repetitive sequences or poly-A sequences, using the XBLAST 
program for masking low complexity as described above in Example 1 . 

Table 2 (inserted before the claims) shows the results of the alignments. Table 2 
refers to each sequence by its SEQ ID NO:, the accession numbers and descriptions of 
nearest neighbors from the Genbank and Non-Redundant Protein searches, and the p values 

25 of the search results. Table 1 identifies each SEQ ID NO: by SEQ name, clone ID, and 

cluster. As discussed above, a single cluster includes polynucleotides representing the same 
gene or gene family, and generally represents sequences encoding the same gene product. 

For each of SEQ ID NOS: 1-800, the best alignment to a protein or DNA sequence is 
included in Table 2. The activity of the polypeptide encoded by SEQ ID NOS: 1 -800 is the 

30 same or similar to the nearest neighbor reported in Table 2. The accession number of the 
nearest neighbor is reported, providing a reference to the activities exhibited by the nearest 
neighbor. The search program and database used for the alignment also are indicated as well 
as a calculation of the p value. 



SEQ ID NOS: 1-404, as well as the validation sequences SEQ ID NOS:405-800, were 
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Full length sequences or fragments of the polynucleotide sequences of the nearest 

neighbors can be used as probes and primers to identify and isolate the full length sequence 

of SEQ ID NOS: 1-800. The nearest neighbors can indicate a tissue or cell type to be used to 

construct a library for the full-length sequences of SEQ ID NOS: 1-800. 

5 SEQ ID NOS: 1 -800 and the translations thereof may be human homologs of known 

genes of other species or novel allelic variants of known human genes. In such cases, these 

new human sequences are suitable as diagnostics or therapeutics. As diagnostics, the human 

sequences SEQ ID NOS: 1-800 exhibit greater specificity in detecting and differentiating 

human cell lines and types than homologs of other species. The human polypeptides 

1 0 encoded by SEQ ID NOS: 1-800 are likely to be less immunogenic when administered to 

humans than homologs from other species. Further, on administration to humans, the 

polypeptides encoded by SEQ ID NOS: 1-800 can show greater specificity or can be better 

regulated by other human proteins than are homologs from other species. 



15 Example 3 : Members of Protein Families 

After conducting a profile search as described in the specification above, several of the 
polynucleotides of the invention were found to encode polypeptides having characteristics of 
a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 3). Thus the 

20 invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 

Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQ ID 

NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


24 


4 transmembrane segments integral membrane proteins 


1218 


578 


rev 


41 


4 transmembrane segments integral membrane proteins 


1086 


413 


rev 


101 


4 transmembrane segments integral membrane proteins 


1206 


544 


rev 


157 


4 transmembrane segments integral membrane proteins 


721 


33 


rev 


341 


4 transmembrane segments integral membrane proteins 


1253 


613 


rev 


395 


4 transmembrane segments integral membrane proteins 


530 


10 


for 


395 


4 transmembrane segments integral membrane proteins 


696 


17 


for 


395 


4 transmembrane segments integral membrane proteins 


471 


39 


rev 


24 


7 transmembrane receptor (Secretin family) 


1301 


491 


rev 


41 


7 transmembrane receptor (Secretin family) 


1309 


10 


rev 


101 


7 transmembrane receptor (Secretin family) 


1330 


296 


rev 


157 


7 transmembrane receptor (Secretin family) 


1173 


249 


rev 


291 


7 transmembrane receptor (Secretin family) 


1400 


269 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 
functional domain(s). 



SEQ ID 

NO: 


Biological Activity (Profile hit) 


Start 


Stnn 


Hir 

JL*1 J 


291 


7 transmembrane receptor (Secretin family) 


712 


130 


for 


305 


7 transmembrane receptor (Secretin family) 


926 


4 


for 


305 


7 transmembrane receptor (Secretin family) 


753 


55 


rev 


315 


7 transmembrane receptor (Secretin family) 


1058 


270 


rev 


341 


7 transmembrane receptor (Secretin family) 


1265 


534 


rev 


116 


Ank repeat 


141 


218 


for 


251 


Ank repeat 


290 


207 


for 


251 


Ank repeat 


467 


387 


for 


63 


ATPases Associated with Various Cellular Activities 




ou 


ror 


116 


ATPases Associated with Various Cellular Activities 


802 


313 


for 


134 


ATPases Associated with Various Cellular Activities 


525 


57 


rev 


136 


ATPases Associated with Various Cellular Activities 


712 


163 


for 


151 


ATPases Associated with Various Cellular Activities 


719 


73 


for 


151 


ATPases Associated with Various Cellular Activities 


386 


13 


for 


384 


ATPases Associated with Various Cellular Activities 


664 


140 


for 


404 


ATPases Associated with Various Cellular Activities 


704 


52 


for 


374 jBasic region plus leucine zipper transcription factors 


298 


146 


for 


97 


Bromodomain (conserved sequence found in human, 
Drosophila and yeast proteins.) 


230 


63 


for 


136 


EF-hand 


121 


207 


for 


242 


EF-hand 


238 


155 


for 


379 


EF-hand 


212 


126 


for 


308 |Eukaryotic aspartyl proteases 


1300 


461 


rev 


2 1 3 jGATA family of transcription factors 


720 


377 


for 


367 (G-protein alpha subunit 


971 


467 


rev 


188 


Phorbol esters/diacylglycerol binding 


91 


177 


for 


251 


Phorbol esters/diacylglycerol binding 


133 


219 


for 


202 


protein kinase 


482 


1 


rev 


202 


protein kinase 


970 


1 


rev 


315 


protein kinase 


739 


158 


for 


315 


protein kinase 


1023 


197 


for 


367 


protein kinase 


1046 


285 


rev 


397 


protein kinase 


511 


6 


for 


256 


Protein phosphatase 2C 


13 


90 


for 


256 


Protein phosphatase 2C 


163 


86 


for 


382 | Protein Tyrosine Phosphatase 


261 


2 


for 


306 


SH3 Domain 


141 


296 


for 


386 


SH3 Domain 


359 


209 


for 


169 


Trypsin 


764 


164 


rev 


188 


WD domain, G-beta repeats 


480 


382 


for 


188 


WD domain, G-beta repeats 


206 


117 


for 


335 


WD domain, G-beta repeats 


3 < 


?2 


for 


23 


wnt family of developmental signaling proteins 


1151 


335 i 


rev 


291 


wnt family of developmental signaling proteins 


779 i 


S9 i 


r ev 


291 


wnt family of developmental signaling proteins 


1347 : 


*82 i 


*ev 


324 


wnt family of developmental signaling proteins 


1180 < 


t99 i 


*ev 


330 


wnt family of developmental signaling proteins 


1180 


199 l 


■ev 


341 


wnt family of developmental signaling proteins 


1399 ; 


560 i 


ev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 





Rinlnair al Activitv (Profile hit) 


Start 


Stop 


Dir 




wnt family of developmental signaling proteins 


880 


49 


rev 1 


188 


WW/rsp5/WWP domain containing proteins 


431 


354 


for_ 


379 


WW/rsp5/WWP domain containing proteins 


12 


89 


for 


395 


WW/rsp5/WWP domain containing proteins 


153 


76 


for 


395 


WW/rsp5/WWP domain containing proteins 


156 


64 


for 


61 


Zinc finger, C2H2 type 


254 


192 


for 


306 


Zinc finger, C2H2 type 


428 


367 


for 


386 iZinc finger. C2H2 type 


191 


253 


for 


322 IZinc finger, CCHC class 


553 


503 


for 


306 


Zinc-binding metalloprotease domain 


101 


60 


rev 


395 


Zinc-binding metalloprotease domain 


28 


69 


rev 



Start and stop indicate the position within the individual sequenes that align with the 
query sequence having the indicated SEQ ID NO. The direction (Dir) indicates the 
orientation of the query sequence with respect to the individual sequence, where forward 
5 (for) indicates that the alignment is in the same direction (left to right) as the sequence 
provided in the Sequence Listing and reverse (rev) indicates that the alignment is with a 
sequence complementary to the sequence provided in the Sequence Listing. 



Some polynucleotides exhibited multiple profile hits because, for example, the particular 
10 sequence contains overlapping profile regions, and/or the sequence contains two different 
functional domains. These profile hits are described in more detail below. 

a) Four Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24, 41, 101, 
157, 341, and 395 correspond to a sequence encoding a polypeptide that is a member of the 4 
transmembrane segments integral membrane protein family (transmembrane 4 family). The 
1 5 transmembrane 4 family of proteins includes a number of evolutionarily-related eukaryotic 
cell surface antigens (Levy et al 9 J. Biol Chem., (1991) 266:14597; Tomlinson et al, Eur. J. 
Immunol (1993) 2J:136; Barclay et al The leucocyte antigen factbooks. (1993) Academic 
Press, London/San Diego). The proteins belonging to this family include: 1) Mammalian 
antigen CD9 (MIC3), which is involved in platelet activation and aggregation; 2) 
20 Mammalian leukocyte antigen CD37, expressed on B lymphocytes; 3) Mammalian 

leukocyte antigen CD53 (OX-44), which is implicated in growth regulation in hematopoietic 
cells; 4) Mammalian lysosomal membrane protein CD63 (melanoma-associated antigen 
ME491; antigen AD1); 5) Mammalian antigen CD81 (cell surface protein TAPA-1), which 
is implicated in regulation of lymphoma cell growth; 6) Mammalian antigen CD82 (protein 
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R2; antigen C33; Kangai 1 (KAI1)), which associates with CD4 or CD8 and delivers 

costimulatory signals for the TCR/CD3 pathway; 7) Mammalian antigen CD151 (SFA-1; 

platelet-endothelial tetraspan antigen 3 (PETA-3)); 8) Mammalian cell surface glycoprotein 

A15 (TALLA-1; MXS1); 9) Mammalian novel antigen 2 (NAG-2); 10) Human tumor- 

5 associated antigen CO-029; 1 1) Schistosoma mansoni and japonicum 23 Kd surface antigen 

(SM23 / SJ23). 

The members of the 4 transmembrane family share several characteristics. First, they 
all are apparently type III membrane proteins, which are integral membrane proteins 
containing an N-terminal membrane-anchoring domain which is not cleaved during 

10 biosynthesis and which functions both as a translocation signal and as a membrane anchor. 
The family members also contain three additional transmembrane regions, at least seven 
conserved cysteines residues, and are of approximately the same size (218 to 284 residues). 
These proteins are collectively know as the "transmembrane 4 superfamily" (TM4) because 
they span plasma membrane four times. A schematic diagram of the domain structure of 

1 5 these proteins is as follows: 

-K+ + +- — + + + + + + 

1 1 TMa | Extra | TM2| Cyt | TM3 | Extracellular | TM4 | Cyt| 

+-+ + + — c — c — + cc— c- — c— +- — c — + 

********* 

20 where Cyt is the cytoplasmic domain, TMa is the transmembrane anchor; TM2 to TM4 
represents transmembrane regions 2 to 4, 'C are conserved cysteines, and '* indicates the 
position of the consensus pattern. The consensus pattern spans a conserved region including 
two cysteines located in a short cytoplasmic loop between two transmembrane domains: 
Consensus pattern: G-x(3)-[LIVMF]-x(2>[GSA]-[LIVMF](2)-G-C-x-[GA]-[STA]- x(2)- 

25 [EG]-x(2)-[CWNHLIVM](2). 

b) Seven Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24, 41, 101, 
157, 291, 305, 315, and 341 correspond to a sequence encoding a polypeptide that is a 
member of the seven transmembrane receptor family. G-protein coupled receptors 
(Strosberg, Eur. J. Biochem. (1991) 196:1; Kerlavage, Curr. Opin. Struct Biol (1991) 

30 7:394; and Probsts al 9 DNA Cell Biol ( 1 992) 77:1; and Savarese et al , Biochem. J. (1992) 

295:1) (also called R7G) are an extensive group of hormones, neurotransmitters, odorants 

and light receptors which transduce extracellular signals by interaction with guanine 

nucleotide-binding (G) proteins. The tertiary structure of these receptors is thought to be 

highly similar. They have seven hydrophobic regions, each of which most probably spans 
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the membrane. The N-terminus is located on the extracellular side of the membrane and is 
often glycosylated, while the C-terminus is cytoplasmic and generally phosphorylated. 
Three extracellular loops alternate with three intracellular loops to link the seven 
transmembrane regions. Most, but not all of these receptors, lack a signal peptide. The most 
5 conserved parts of these proteins are the transmembrane regions and the first two 

cytoplasmic loops. A conserved acidic-Arg-aromatic triplet is present in the N-terminal 
extremity of the second cytoplasmic loop (Attwood et aL, Gene (1991) 95:153) and could 
be implicated in the interaction with G proteins. 

To detect this widespread family of proteins a pattern is used that contains the 

1 0 conserved triplet and that also spans the major part of the third transmembrane helix. 

Additional information about the seven transmembrane receptor family, and methods for 
their identification and use, is found in U.S. Patent No. 5,759,804. Due in part to their 
expression on the cell surface and other attractive characteristics, seven transmembrane 
protein family members are of particular interest as drug targets, as surface antigen markers, 

15 and as drug delivery targets (e.g., using antibody-drug complexes and/or use of anti-seven 
transmembrane protein antibodies as therapeutics in their own right). 

c) Ank Repeats. SEQ ID NOS: 1 16 and 251 represent polynucleotides encoding Ank 
repeat-containing proteins. The ankyrin motif is a 33 amino acid sequence named after the 
protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank repeats were originally 

20 identified in the cell-cycle-control protein cdclO (Breeden et aL, Nature (1987) 329:651). 

Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle 
protein cdclO, the Notch receptor (Matsuno et aL, Development (1 997) 124(21):4265); G9a 
(or BAT8) of the class III region of the major histocompatibility complex (Biochem J. 
290:811-818, 1993), FABP, GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions 

25 of the ankyrin repeats are compatible with a role in protein-protein interactions (Bork, 

Proteins (1993) 1 7(4):363; Lambert and Bennet, Eur. J. Biochem. (1993) 277:1; Kerr et aL, 
Current Op. Cell Biol. (1992) 4:496; Bennet et aL, J. Biol. Chem. (1980) 255:6424). 

The 90 kD N-terminal domain of ankyrin contains a series of 24 33-amino-acid ank 
repeats. (Lux et aL, Nature (1990) 344:36-42, Lambert et aL, PNAS USA (1990) 57:1730.) 

30 The 24 ank repeats form four folded subdomains of 6 repeats each. These four repeat 

subdomains mediate interactions with at least 7 different families of membrane proteins. 

Ankyrin contains two separate binding sites for anion exchanger dimers. One site utilizes 

repeat subdomain two (repeats 7-12) and the other requires both repeat subdomains 3 and 4 

(repeats 13-24). Since the anion exchangers exist in dimers, ankyrin binds 4 anion 
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exchangers at the same time. (Michaely and Bennett, J. Biol Chem. (1995) 2 70(3 7) :22050) 
The repeat motifs are involved in ankyrin interaction with tubulin, spectrin, and other 
membrane proteins. (Lux et al. 9 Nature (1990) 344:36.) 

The Rel/NF-kappaB/Dorsal family of transcription factors have activity that is 
5 controlled by sequestration in the cytoplasm in association with inhibitory proteins referred 
to as I-kappaB. (Gilmore, Cell (1990) 62:841; Nolan and Baltimore, Curr Opin Genet Dev. 
(1992) 2:21 1 ; Baeuerle, Biochim Biophys Acta (1991) 1072:63; Schmitz et a!., Trends Cell 
Biol (1991) 7:130.) I-kappaB proteins contain 5 to 8 copies of 33 amino acid ankyrin 
repeats and certain NF-kappaB/rel proteins are also regulated by cis-acting ankyrin repeat 
10 containing domains including pl05NF-kappaB which contains a series of ankyrin repeats 
(Diehl and Hannink, J. Virol. (1993) 67(12):1\6\). The I-kappaBs and Cactus (also 
containing ankyrin repeats) inhibit activators through differential interactions with the Rel- 
homology domain. The gene family includes proto-oncogenes, thus broadly implicating I- 
kappaB in the control of both normal gene expression and the aberrant gene expression that 
15 makes cells cancerous. (Nolan and Baltimore, Curr Opin Genet Dev. (1992) 2(2):2\ 1-220). 
In the case of rel/NF-kappaB and pp40/I-kappaBp, both the ankyrin repeats and the carboxy- 
terminal domain are required for inhibiting DNA-binding activity and direct association of 
pp40/I-kappaBp with rel/NF-kappaB protein. The ankyrin repeats and the carboxy-terminal 
of pp40/I-kappaBp ( form a structure that associates with the rel homology domain to inhibit 
20 DNA binding activity (Inoue et al. 9 PNAS USA (1992) 59:4333). 

The 4 ankyrin repeats in the amino terminus of the transcription factor subunit 
GABPp are required for its interaction with the GABPa subunit to form a functional high 
affinity DNA-binding protein. These repeats can be crosslinked to DNA when GABP is 
bound to its target sequence. (Thompson et al., Science (1991) 253:762; LaMarco et al. y 
25 Science (1991) 255:789). 

Myotrophin, a 12.5 kDa protein having a key role in the initiation of cardiac 
hypertrophy, comprises ankyrin repeats. The ankyrin repeats are characteristic of a hairpin- 
like protruding tip followed by a helix-turn-helix motif. The V-shaped helix-tum-helix of 
the repeats stack sequentially in bundles and are stabilized by compact hydrophobic cores, 
30 whereas the protruding tips are less ordered. 

d) ATPases Associated with Various Cellular Activities (AAA). SEQIDNOS:63, 
1 16, 134, 136, 151, 384, and 404 polynucleotides encoding novel members of the "ATPases 
Associated with diverse cellular Activities" (AAA) protein family The AAA protein family 
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is composed of a large number of ATPases that share a conserved region of about 220 amino 

acids that contains an ATP-binding site (Froehlich et al, J. Cell Biol (1991) 114:443; 

Erdmann et al Cell (1991) 64A99\ Peters et al, EMBOJ. (1990) P:1757; Kunau et al., 

Biochimie (1993 ) 75:209-224; Confalonieri et al, BioEssays (1995) 77:639; 

5 http://yeamob.pci.chemie.uni-tuebingen.de/AAA/Description.html). The proteins that 

belong to this family either contain one or two AAA domains. 

Proteins containing two AAA domains include: 1) Mammalian and drosophila NSF 

(N-ethylmaleimide-sensitive fusion protein) and the fungal homolog, SEC 18, which are 

involved in intracellular transport between the endoplasmic reticulum and Golgi, as well as 

1 0 between different Golgi cisternae; 2) Mammalian transitional endoplasmic reticulum 

ATPase (previously known as p97 or VCP), which is involved in the transfer of membranes 
from the endoplasmic reticulum to the golgi apparatus. This ATPase forms a ring-shaped 
homooligomer composed of six subunits. The yeast homolog, CDC48, plays a role in 
spindle pole proliferation; 3) Yeast protein PAS1 essential for peroxisome assembly and the 

1 5 related protein PAS1 from Pichia pastoris; 4) Yeast protein AFG2; 5) Sulfolobus 

acidocaldarius protein SAV and Halobacterium salinarium cdcH, which may be part of a 
transduction pathway connecting light to cell division. 

Proteins containing a single AAA domain include: 1) Escherichia coli and other 
bacteria ftsH (or hflB) protein. FtsH is an ATP-dependent zinc metallopeptidase that 

20 degrades the heat-shock sigma-32 factor, and is an integral membrane protein with a large 
cytoplasmic C-terminal domain that contain both the AAA and the protease domains; 2) 
Yeast protein YME1, a protein important for maintaining the integrity of the mitochondrial 
compartment. YME1 is also a zinc-dependent protease; 3) Yeast protein AFG3 (or YTA10). 
This protein also contains an AAA domain followed by a zinc-dependent protease domain; 

25 4) Subunits from regulatory complex of the 26S proteasome (Hilt et al, Trends Biochem. 
ScL (1996) 27:96), which is involved in the ATP-dependent degradation of ubiquitinated 
proteins, which subunits include: a) Mammalian 4 and homologs in other higher eukaryotes, 
in yeast (gene YTA5) and fission yeast (gene mts2); b) Mammalian 6 (TBP7) and homologs 
in other higher eukaryotes and in yeast (gene YTA2); c) Mammalian subunit 7 (MSS1) and 

30 homologs in other higher eukaryotes and in yeast (gene CIM5 or YTA3); d) Mammalian 

subunit 8 (P45) and homologs in other higher eukaryotes and in yeast (SUG1 or CIM3 or 

TBY1) and fission yeast (gene letl); e) Other probable subunits include human TBP1, which 

influences HIV gene expression by interacting with the virus tat transactivator protein, and 

yeast YTA1 and YTA6; 5) Yeast protein BCS1, a mitochondrial protein essential for the 

91 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 ^ PCT/US98/27610 
expression of the Rieske iron-sulfur protein; 6) Yeast protein MSP1, a protein involved in 

intramitochondrial sorting of proteins; 7) Yeast protein PAS8, and the corresponding 

proteins PASS from Pichia pastoris and PAY4 from Yarrowia lipolytica; 8) Mouse protein 

SKD1 and its fission yeast homolog (SpAC2Gl 1.06); 9) Caenorhabditis elegans meiotic 

5 spindle formation protein mei-1; 10) Yeast protein SAP V 1 1) Yeast protein YTA7; and 12) 

Mycobacterium leprae hypothetical protein A2126A. 

In general, the AAA domains in these proteins act as ATP-dependent protein 

clamps(Confalonieri et al (1995) BioEssays 1 7:639). In addition to the ATP-binding 'A' and 

'B' motifs, which are located in the N-terminal half of this domain, there is a highly 

10 conserved region located in the central part of the domain which was used in the 

development of the signature pattern. The consensus pattern is: [LIVMT]-x-[LIVMT]- 

[LIVMF]-x-[GATMC]-[ST]-[NS]-x(4)-[LIVM]- D-x-A-[LIFA]-x-R. 

e) Basic Region Plus Leucine Zipper Transcription Factors. SEQIDNO:374 
correspond to a polynucleotide encoding a novel member of the family of basic region plus 

15 leucine zipper transcription factors. The bZIP superfamily (Hurst, Protein Prof, (1995) 
2:105; and Ellenberger, Curr. Opin. Struct. Biol (1994) 4:12) of eukaryotic DNA-binding 
transcription factors encompasses proteins that contain a basic region mediating sequence- 
specific DNA-binding followed by a leucine zipper required for dimerization. Members of 
the family include transcription factor AP-1, which binds selectively to enhancer elements in 

20 the cis control regions of S V40 and metallothionein IIA. AP- 1 , also known as c-jun, is the 
cellular homolog of the avian sarcoma virus 17 (ASV17) oncogene v-jun. 

Other members of this protein family include jun-B and jun-D, probable 
transcription factors that are highly similar to jun/AP-1 ; the fos protein, a proto-oncogene 
that forms a non-covalent dimer with c-jun; the fos-related proteins fra-1, and fos B; and 

25 mammalian cAMP response element (CRE) binding proteins CREB, CREM, ATF-l, ATF- 
3, ATF-4, ATF-5, ATF-6 and LRF-1. The consensus pattern for this protein family is: 
[KR]-x( 1 ? 3)-[RKSAQ]-N-x(2)-[SAQ](2)-x-[RKTAENQ]-x-R-x-[RK]. 

f) Bromodomain. SEQ ID NO:97 corresponds to a polynucleotide encoding a 
polypeptide having a bromodomain region (Haynes et aL, 1992, Nucleic Acids Res. 

30 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and Tamkun, 1995, Curr. Opin. Genet. 

Dev. 5:473-477), which is a conserved region of about 70 amino acids found in the 

following proteins: 1 ) Higher eukaryotes transcription initiation factor TFIID 250 Kd 

subunit (TBP-associated factor p250) (gene CCG1); P250 is associated with the TFIID 

TATA-box binding protein and seems essential for progression of the Gl phase of the cell 
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cycle. 2) Human RING3, a protein of unknown function encoded in the MHC class II locus; 

3) Mammalian CREB-binding protein (CBP), which mediates cAMP-gene regulation by 

binding specifically to phosphorylated CREB protein; 4) Mammalian homologs of brahma, 

including three brahma-like human: SNF2a(hBRM), SNF2b, and BRG1 ; 5) Human BS69, 

5 a protein that binds to adenovirus El A and inhibits El A transactivation; 6) Human peregrin 

(or Br 140). 

The bromodomain is thought to be involved in protein-protein interactions and may 
be important for the assembly or activity of multicomponent complexes involved in 
transcriptional activation. The consensus pattern, which spans a major part of the 
1 0 bromodomain, is: [STANVF]-x(2)-F-x(4)-[D^ 
[LIVMFY]-x(3)-[LIVM]-x(4)-[U^^ 
[FY]. 

g) EF-Hand. SEQ ID NOS:136, 242, and 379 correspond to polynucleotides 
encoding a novel protein in the family of EF-hand proteins. Many calcium-binding proteins 

1 5 belong to the same evolutionary family and share a type of calcium-binding domain known 
as the EF-hand (Kawasaki et al, Protein. Prof. (1995) 2:305-490). This type of domain 
consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical 
domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal 
configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; 

20 these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 
12 provides two oxygens for liganding Ca (bidentate ligand). 

Proteins known to contain EF-hand regions include: Calmodulin (Ca=4, except in 
yeast where Ca=3) ("Ca=" indicates approximate number of EF-hand regions); 
diacylglycerol kinase (EC 2.7.1.107) (DGK) (Ca=2); 2) FAD-dependent glycerol-3- 

25 phosphate dehydrogenase (EC 1 . 1 .99.5) from mammals (Ca=l); guanylate cyclase activating 
protein (GCAP) (Ca=3); MIF related proteins 8 (MRP-8 or CFAG) and 14 (MRP- 14) 
(Ca=2); myosin regulatory light chains (Ca=l); oncomodulin (Ca=2); osteonectin (basement 
membrane protein BM-40) (SPARC); and proteins that contain an "osteonectin" domain 
(QR1, matrix glycoprotein SCI). 

30 The consensus pattern includes the complete EF-hand loop as well as the first residue 

which follows the loop and which seem to always be hydrophobic. 

Consensus pattern: D-x-[DNS].{ILVFYW}-[DENSTG]-[DNQGHRK]-{GP}- 
[LIVMC]-[DENQSTAGC]-x(2)-[DE]-[LIVMFYW] 
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h) Eukarvotic Aspartyl Proteases. SEQ ID NO: 308 corresponds to a gene encoding a 
novel eukaryotic aspartyl protease. Aspartyl proteases, known as acid proteases, (EC 
3.4.23.-) are a widely distributed family of proteolytic enzymes (Foltmann B., Essays 
Biochem. (1981) 77:52; Davies D.R., Annu. Rev. Biophys. Chem. (1990) 7P:189; Rao 

5 J.K.M., et ai, Biochemistry (1991) 30:4663) known to exist in vertebrates, fungi, plants, 
retroviruses and some plant viruses. Aspartate proteases of eukaryotes are monomeric 
enzymes which consist of two domains. Each domain contains an active site centered on a 
catalytic aspartyl residue. The two domains most probably evolved from the duplication of 
an ancestral gene encoding a primordial domain. Currently known eukaryotic aspartyl 

10 proteases include: 1) Vertebrate gastric pepsins A and C (also known as gastricsin); 2) 
Vertebrate chymosin (rennin), involved in digestion and used for making cheese; 3) 
Vertebrate lysosomal cathepsins D (EC 3.4.23.5) and E (EC 3.4.23.34); 4) Mammalian renin 
(EC 3.4.23.15) whose function is to generate angiotensin I from angiotensinogen in the 
plasma; 5) Fungal proteases such as aspergillopepsin A (EC 3.4.23.18), candidapepsin (EC 

15 3.4.23.24), mucoropepsin (EC 3.4.23.23) (mucor rennin), endothiapepsin (EC 3.4.23.22), 
polyporopepsin (EC 3.4.23.29), and rhizopuspepsin (EC 3.4.23.21); and 6) Yeast 
saccharopepsin (EC 3.4.23.25) (proteinase A) (gene PEP4). PEP4 is implicated in 
posttranslational regulation of vacuolar hydrolases; 7) Yeast barrierpepsin (EC 3.4.23.35) 
(gene BAR1); a protease that cleaves alpha-factor and thus acts as an antagonist of the 

20 mating pheromone; and 8) Fission yeast sxal which is involved in degrading or processing 
the mating pheromones. 

Most retroviruses and some plant viruses, such as badnaviruses, encode for an 
aspartyl protease which is an homodimer of a chain of about 95 to 125 amino acids. In most 
retroviruses, the protease is encoded as a segment of a polyprotein which is cleaved during 

25 the maturation process of the virus. It is generally part of the pol polyprotein and, more 

rarely, of the gag polyprotein. Because the sequence around the two aspartates of eukaryotic 
aspartyl proteases and around the single active site of the viral proteases is conserved, a 
single signature pattern can be used to identify members of both groups of proteases. The 
consensus pattern is: [LIVMFGAC]-[LIVMTADN]-[LIVFSA]-D-[ST]-G-[STAV]- 

30 [STAPDENQ]- x-[LIVMFSTNC]-x-[LIVMFGTA], where D is the active site residue. 

i) GATA Family of Transcription Factors. SEQ ID NO:213 corresponds to a novel 
member of the GATA family of transcription factors. The GATA family of transcription 
factors are proteins that bind to DNA sites with the consensus sequence (A/T)GATA(A/G), 
found within the regulatory region of a number of genes. Proteins currently known to belong 
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to this family are: 1) GATA-1 (Trainor, CD., et aL, Nature (1990) 343:92) (also known as 
Eryfl, GF-1 or NF-E1), which binds to the GATA region of globin genes and other genes 
expressed in erythroid cells. It is a transcriptional activator which probably serves as a 
general 'switch' factor for erythroid development; 2) GATA-2 (Lee, ME., et al.,J. BioL 
5 Chem. (1991) 26(5:16188), a transcriptional activator which regulates endothelin-1 gene 
expression in endothelial cells; 3) GATA-3 (Ho, I.-C, et aL, EMBO J. (1991) 70:1 187), a 
transcriptional activator which binds to the enhancer of the T-cell receptor alpha and delta 
genes; 4) GATA-4 (Spieth, J., et aL, Mol Cell. Biol. (1991) 77:4651), a transcriptional 
activator expressed in endodermally derived tissues and heart; 5) Drosophila protein pannier 
1 0 (or DGATAa) (gene pnr) which acts as a repressor of the achaete-scute complex (as-c); 6) 
Bombyx mori BCFI (Drevet, J.R., et aL, J. BioL Chem. (1994) 269: 10660), which regulates 
the expression of chorion genes; 7) Caenorhabditis elegans elt-1 and elt-2, transcriptional 
activators of genes containing the GATA region, including vitellogenin genes (Hawkins, 
M.G., et al.,J. BioL Chem. (1995) 270:14666); 8) Ustilago maydis urbsl (Voisard, C.P.O., 
15 et aL, MoL Cell. BioL (1993) 75:7091), a protein involved in the repression of the 
biosynthesis of siderophores; 9) Fission yeast protein GAF2. 

All these transcription factors contain a pair of highly similar 'zinc finger' type 
domains with the consensus sequence C-x2-C-xl7-C-x2-C. Some other proteins contain a 
single zinc finger motif highly related to those of the GATA transcription factors. These 
20 proteins are: 1) Drosophila box A-binding factor (ABF) (also known as protein serpent 

(gene srp)) which may function as a transcriptional activator protein and may play a key role 
in the organogenesis of the fat body; 2) Emericella nidulans are (Arst, H.N., Jr., et aL, 
Trends Genet. (1989) 5:291) a transcriptional activator which mediates nitrogen metabolite 
repression; 3) Neurospora crassa nit-2 (Fu, Y.-H., et aL, MoL Cell. BioL (1990) 70:1056), a 
25 transcriptional activator which turns on the expression of genes coding for enzymes required 
for the use of a variety of secondary nitrogen sources, during conditions of nitrogen 
limitation; 4) Neurospora crassa white collar proteins 1 and 2 (WC-1 and WC-2), which 
control expression of light-regulated genes; 5) Saccharomyces cerevisiae DAL81 (or 
UGA43), a negative nitrogen regulatory protein; 6) Saccharomyces cerevisiae GLN3, a 
30 positive nitrogen regulatory protein; 7) Saccharomyces cerevisiae GAT1 ; 8) Saccharomyces 
cerevisiae GZF3. 

The consensus pattern for the GATA family is: C-x-[DN]-C-x(4 5 5)-[ST]-x(2)-W- 
[HR]-[RK]-x(3)-[GN]-x(3,4)-C-N-[AS]-C, where the four C's are zinc ligands. 
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j) G-Protein Alpha Subunit. SEQ ID NO: 3 67 corresponds to a gene encoding a novel 

polypeptide of the G-protein alpha subunit family. Guanine nucleotide binding proteins (G- 
proteins) are a family of membrane-associated proteins that couple extracellularly-activated 
integral-membrane receptors to intracellular effectors, such as ion channels and enzymes that 
5 vary the concentration of second messenger molecules. G-proteins are composed of 3 

subunits (alpha, beta and gamma) which, in the resting state, associate as a trimer at the inner 
face of the plasma membrane. The alpha subunit has a molecule of guanosine diphosphate 
(GDP) bound to it. Stimulation of the G-protein by an activated receptor leads to its 
exchange for GTP (guanosine triphosphate). This results in the separation of the alpha from 
1 0 the beta and gamma subunits, which always remain tightly associated as a dimer. Both the 
alpha and beta-gamma subunits are then able to interact with effectors, either individually or 
in a cooperative manner. The intrinsic GTPase activity of the alpha subunit hydrolyses the 
bound GTP to GDP. This returns the alpha subunit to its inactive conformation and allows it 
to reassociate with the beta-gamma subunit, thus restoring the system to its resting state. 
1 5 G-protein alpha subunits are 350-400 amino acids in length and have molecular 

weights in the range 40-45 kDa. Seventeen distinct types of alpha subunit have been 
identified in mammals. These fall into 4 main groups on the basis of both sequence 
similarity and function: alpha-s, alpha-q, alpha-i and alpha- 12 (Simon et aL 9 Science (1993) 
252:802). Many alpha subunits are substrates for ADP-ribosylation by cholera or pertussis 
20 toxins. They are often N-terminally acylated, usually with myristate and/or palmitoylate, 
and these fatty acid modifications are probably important for membrane association and 
high- affinity interactions with other proteins. The atomic structure of the alpha subunit of 
the G-protein involved in mammalian vision, transducin, has been elucidated in both GTP- 
and GDB-bound forms, and shows considerable similarity in both primary and tertiary 
25 structure in the nucleotide-binding regions to other guanine nucleotide binding proteins, such 
as p21-ras and EF-Tu. 

k) Phorbol Esters/Diacvl glycerol Binding. SEQ ID NO: 1 88 and 25 1 represent 
polynucleotides encoding a protein belonging to the family including phorbol 
esters/diacylglycerol binding proteins. Diacylglycerol (DAG) is an important second 
30 messenger. Phorbol esters (PE) are analogues of DAG and potent tumor promoters that 

cause a variety of physiological changes when administered to both cells and tissues. DAG 
activates a family of serine/threonine protein kinases, collectively known as protein kinase C 
(PKC) (Azzi et al. 9 Eur. J. Biochem. (1992) 205:547). Phorbol esters can directly stimulate 

PKC. The N-terminal region of PKC, known as CI, has been shown (Ono et al^Proc. Natl 
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Acad Sci. USA (1989) 56:4868) to bind PE and DAG in a phospholipid and zinc-dependent 

fashion. The CI region contains one or two copies (depending on the isozyme of PKC) of a 

cysteine-rich domain about 50 amino-acid residues long and essential for DAG/PE-binding. 

Such a domain has also been found in, for example, the following proteins. 
5 ( 1 ) Diacylglycerol kinase (EC 2.7.1.107) (DGK) (Sakane et al , Nature ( 1 990) 

344:345), the enzyme that converts DAG into phosphatidate. It contains two copies of the 

DAG/PE-binding domain in its N-terminal section. At least five different forms of DGK are 

known in mammals; and 

(2) N-chimaerin, a brain specific protein which shows sequence similarities with the 
1 0 BCR protein at its C-terminal part and contains a single copy of the DAG/PE-binding 

domain at its N-terminal part. It has been shown (Ahmed et al. 9 Biochem. J. (1990) 272:767, 

and Ahmed et al.Biochem. J. (1991) 280:233) to be able to bind phorbol esters. 

The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are 

probably the six cysteines and two histidines that are conserved in this domain. The 
1 5 signature pattern completely spans the DAG/PE domain. The consensus pattern is: H-x- 

[LIVMFYW]-x(8Jl)-C-x(2)-^ 

C. All the C and H are probably involved in binding zinc. 

1) Protein Kinase. SEQ ID NOS:202, 315, 367, and 397 represent polynucleotides 
encoding protein kinases. Protein kinases catalyze phosphorylation of proteins in a variety of 

20 pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks S.K., et al 9 
FASEBJ. (1995) 9:576; Hunter T., Meth. Enzymol (1991) 200:3; Hanks S.K., et al, Meth 
Enzymol (1991) 200:38; Hanks S.K., Curr. Opin. Struct. Biol (1991) 7:369; Hanks S.K., et 
al, Science (1988) 241:42) are enzymes that belong to a very extensive family of proteins 
which share a conserved catalytic core common to both serine/threonine and tyrosine protein 

25 kinases. There are a number of conserved regions in the catalytic domain of protein kinases. 
Two of the conserved regions are the basis for the signature pattern in the protein kinase 
profile. The first region, which is located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has 
been shown to be involved in ATP binding. The second region, which is located in the 

30 central part of the catalytic domain, contains a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (Knighton D.R., et al, Science (1991) 
255:407). The protein kinase profile includes two signature patterns for this second region: 
one specific for serine/threonine kinases and the other for tyrosine kinases. A third profile is 
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based on the alignment in (Hanks S.K., et al % FASEB J. (1995) 9:576) and covers the entire 
catalytic domain. The consensus patterns are as follows: 

1) Consensus pattern: [LIV]-G-{P}-G-{P}-[FYWMGSTNH]-[SGA]-{PW}« 
[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5,18)-[LIVMFYWCSTAR]-[AIVP]- 
5 [LIVMFAGCKR]-K, where K binds ATP. The majority of known protein kinases are 
detected by this pattern. Proteins kinases that are not detected by this consensus include 
viral kinases, which are quite divergent in this region and are completely missed by this 
pattern. 



10 [LIVMFYCT](3), where D is an active site residue. This consensus sequence identifies most 
serine/threonine-specific protein kinases with only 10 exceptions. Half of the exceptions are 
viral kinases, while the other exceptions include Epstein-Barr virus BGLF4 and Drosophila 
ninaC, which have Ser and Arg, respectively, instead of the conserved Lys. These latter two 
protein kinases are detected by the tyrosine kinase specific pattern described below. 

1 5 3) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-[RSTAC]-x(2)-N- 

[LIVMFYC], where D is an active site residue. All tyrosine-specific protein kinases are 
detected by this consensus pattern, with the exception of human ERBB3 and mouse blk. 
This pattern also detects most bacterial aminoglycoside phosphotransferases (Benner S., 
Nature (1987) 329:21; Kirby R., J. Mol Evol (1992) 50:489) and herpesviruses ganciclovir 

20 kinases (Littler E., et aL, Nature ( 1 992) 358: 1 60), which are structurally and evolutionary 
related to protein kinases. 

The protein kinase profile also detects receptor guanylate cyclases and 2-5A- 
dependent ribonucleases. Sequence similarities between these two families and the 
eukaryotic protein kinase family have been noticed previously. The profile also detects 

25 Arabidopsis thaliana kinase-like protein TMKL1 which seems to have lost its catalytic 
activity. 

If a protein analyzed includes the two of the above protein kinase signatures, the 
probability of it being a protein kinase is close to 100%. Eukaryotic-type protein kinases 
have also been found in prokaryotes such as Myxococcus xanthus (Munoz-Dorado J., et aL 9 
30 Cell (1991) 67:995) and Yersinia pseudotuberculosis. The patterns shown above has been 
updated since their publication in (Bairoch A., et al, Nature (1988) 331:22). 

m) Protein Phosphatase 2C. SEQ ID NO:256 corresponds to a polynucleotide 
encoding a novel protein phosphatase 2C (PP2C), which is one of the four major classes of 
mammalian serine/threonine specific protein phosphatases. PP2C (Wenk et ai 9 FEBS Lett. 



2) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N- 
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(1992) 297:135) is a monomeric enzyme of about 42 Kd which shows broad substrate 

specificity and is dependent on divalent cations (mainly manganese and magnesium) for its 

activity. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. 

n) Protein Tyrosine Phosphatase. SEQ ID NO:382 represents a polynucleotide 

5 encoding a protein tyrosine kinase. Tyrosine specific protein phosphatases (EC 3.1 .3.48) 

(PTPase) (Fischer et aL, Science (1991) 253:401 ; Charbonneau et al.Annu. Rev. Cell Biol. 

(1992) 8:463; Trowbridge, J. Biol Chem. (1991) 2(56:23517; Tonks et aL, Trends Biochem. 

Set (1989) 74:497; and Hunter, Cell (1989) 55:1013) catalyze the removal of a phosphate 

group attached to a tyrosine residue. These enzymes are very important in the control of cell 

10 growth, proliferation, differentiation and transformation. Multiple forms of PTPase have 
been characterized and can be classified into two categories: soluble PTPases and 
transmembrane receptor proteins that contain PTPase domain(s). 

Soluble PTPases include PTPN3 (HI) and PTPN4 (MEG), enzymes that contain an 
N-terminal band 4.1 -like domain and could act at junctions between the membrane and 

15 cytoskeleton; PTPN6 (PTP-1C; HCP; SHP) and PTPN1 1 (PTP-2C; SH-PTP3; Syp), 
enzymes that contain two copies of the SH2 domain at its N-terminal extremity. 

Dual specificity PTPases include DUSP1 (PTPN10; MAP kinase phosphatase- 1 ; 
MKP-1) which dephosphorylates MAP kinase on both Thr-183 and Tyr-185; and DUSP2 
(PAC-1), a nuclear enzyme that dephosphorylates MAP kinases ERK1 and ERK2 on both 

20 Thr and Tyr residues. 

Structurally, all known receptor PTPases are made up of a variable length 
extracellular domain, followed by a transmembrane region and a C-terminal catalytic 
cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) 
repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains 

25 in their extracellular region. The cytoplasmic region generally contains two copies of the 

PTPAse domain. The first seems to have enzymatic activity, while the second is inactive but 
seems to affect substrate specificity of the first. In these domains, the catalytic cysteine is 
generally conserved but some other, presumably important, residues are not. 

PTPase domains consist of about 300 amino acids. There are two conserved 

30 cysteines and the second one has been shown to be absolutely required for activity. 

Furthermore, a number of conserved residues in its immediate vicinity have also been shown 
to be important. The consensus pattern for PTPases is: [LIVMF]-H-C-x(2)-G-x(3)-[STC]- 
[STAGP]-x-[LIVMFY]; C is the active site residue. 
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o) SH3 Domain. SEQ ID NO:306 and 386 represent polynucleotides encoding SH3 

domain proteins. The Src homology 3 (SH3) domain is a small protein domain of about 60 
amino acid residues first identified as a conserved sequence in the non-catalytic part of 
several cytoplasmic protein tyrosine kinases (e.g. Src, Abl, Lck) (Mayer et at, Nature (1988) 
5 332:272). The domain has also been found in a variety of intracellular or membrane- 
associated proteins (Musacchio et al, FEBS Lett. (1992) 507:55; Pawson et al., Curr. Biol 
(1993) 5:434; Mayer et al, Trends Cell Biol (1993) 5:8; and Pawson et al, Nature (1995) 
575:573). 

The SH3 domain has a characteristic fold that consists of five or six beta-strands 
1 0 arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain 

short helices (Kuriyan et al, Curr. Opin. Struct. Biol (1993) 5:828). It is believed that SH3 
domain-containing proteins mediate assembly of specific protein complexes via binding to 
proline-rich peptides (Morton et al, Curr. Biol (1994) 4:615). In general, SH3 domains are 
found as single copies in a given protein, but there is a significant number of proteins with 

1 5 two SH3 domains and a few with 3 or 4 copies. 

SH3 domains have been identified in, for example, protein tyrosine kinases, such as 
the Src, Abl, Bkt, Csk and ZAP70 families of kinases; mammalian phosphatidylinositol- 
specific phospholipase C-gamma-1 and -2; mammalian phosphatidyl inositol 3-kinase 
regulatory p85 subunit; mammalian Ras GTPase-activating protein (GAP); mammalian Vav 

20 oncoprotein, a guanine nucleotide exchange factor of the CDC24 family; Drosophila 
lethal(l)discs large- 1 tumor suppressor protein (gene Dlgl); mammalian tight junction 
protein ZO-1; vertebrate erythrocyte membrane protein p55; Caenorhabditis elegans protein 
lin-2; rat protein CASK; and mammalian synaptic proteins SAP90/PSD-95, CHAPS YN- 
1 10/PSD-93, SAP97/DLG1 and SAP102. Novel SH3-domain containing polypeptides will 

25 facilitate elucidation of the role of such proteins in important biological pathways, such as 
ras activation. 

p) Trypsin. SEQ ID NO: 169 corresponds to a novel serine protease of the trypsin 

family. The catalytic activity of the serine proteases from the trypsin family is provided by a 

charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which 

30 itself is hydrogen-bonded to a serine. The sequences in the vicinity of the active site serine 

and histidine residues are well conserved in this family of proteases (Brenner S., Nature 

(1 988) 554:528). Proteases known to belong to the trypsin family include: 1 ) Acrosin; 2) 

Blood coagulation factors VII, IX, X, XI and XII, thrombin, plasminogen, and protein C; 3) 

Cathepsin G; 4) Chymotrypsins; 5) Complement components Clr, Cls, C2, and complement 
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factors B, D and I; 6) Complement-activating component of RA-reactive factor; 7) Cytotoxic 
cell proteases (granzymes A to H); 8) Duodenase I; 9) Elastases 1, 2, 3 A, 3B (protease E), 
leukocyte (medullasin).; 10) Enterokinase (EC 3.4.21.9) (enteropeptidase); 1 1 ) Hepatocyte 
growth factor activator; 12) Hepsin; 13) Glandular (tissue) kallikreins (including EGF- 
5 binding protein types A, B ? and C, NGF-gamma chain, gamma-renin, prostate specific 

antigen (PSA) and tonin); 14) Plasma kallikrein; 15) Mast cell proteases (MCP) 1 (chymase) 
to 8; 16) Myeloblastin (proteinase 3) (Wegener's autoantigen); 17) Plasminogen activators 
(urokinase-type, and tissue-type); 18) Trypsins I, II, III, and IV; 19) Tryptases; 20) Snake 
venom proteases such as ancrod, batroxobin, cerastobin, flavoxobin, and protein C activator; 

10 21) Collagenase from common cattle grub and collagenolytic protease from Atlantic sand 
fiddler crab; 22) Apolipoprotein(a); 23) Blood fluke cercarial protease; 24) Drosophila 
trypsin like proteases: alpha, easter, snake-locus; 25) Drosophila protease stubble (gene sb); 
and 26) Major mite fecal allergen Der p III. All the above proteins belong to family S 1 in 
the classification of peptidases (Rawlings N.D., et al, Meth Enzymol (1994) 244A9; 

15 http://www.expasv.ch/cgi-bin/lists7peptidas.txt ) and originate from eukaryotic species. It 
should be noted that bacterial proteases that belong to family S2A are similar enough in the 
regions of the active site residues that they can be picked up by the same patterns. 

The consensus patterns for this trypsin protein family are: 1) [LIVM]-[ST]-A- 
[STAG]-H-C, where H is the active site residue. All sequences known to belong to this class 

20 detected by the pattern, except for complement components Clr and Cls, pig plasminogen, 
bovine protein C, rodent urokinase, ancrod, gyroxin and two insect trypsins; 2) 
[DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]- [LIVMFYWH]- 
[LIVMFYSTANQH], where S is the active site residue. All sequences known to belong to 
this family are detected by the above consensus sequences, except for 1 8 different proteases 

25 which have lost the first conserved glycine. If a protein includes both the serine and the 

histidine active site signatures, the probability of it being a trypsin family serine protease is 



q) WD Domain. G-Beta Repeats. SEQ ID NOS:188 and 335 represent novel 
members of the WD domain/G-beta repeat family. Beta-transducin (G-beta) is one of the 
30 three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G 
proteins) which act as intermediaries in the transduction of signals generated by 
transmembrane receptors (Gilman,^4w7w. Rev. Biochem. (1987) 56:615). The alpha subunit 
binds to and hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but 
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they seem to be required for the replacement of GDP by GTP as well as for membrane 

anchoring and receptor recognition. 

In higher eukaryotes, G-beta exists as a small multigene family of highly conserved 

proteins of about 340 amino acid residues. Structurally, G-beta consists of eight tandem 

5 repeats of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is 

sometimes called a WD-40 repeat). Such a repetitive segment has been shown to exist in a 

number of other proteins including: human LIS1, a neuronal protein involved in type-1 

lissencephaly; and mammalian coatomer beta 1 subunit (beta'-COP), a component of a 

cytosolic protein complex that reversibly associates with Golgi membranes to form vesicles 

1 0 that mediate biosy nthetic protein transport. 

The consensus pattern for the WD domain/G-Beta repeat family is: [LIVMSTAC]- 

[LIVMFYWSTAGC]^LIMSTAG]4LIVMSTAGC]<2)-[DN]-x(2)4LIVMWSTAC]-x- 

[LIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN]. 

r) wnt Family of Developmental Signaling Proteins. SEQ ID NO: 23, 291 , 324, 330, 

15 341, and 353 correspond to novel members of the wnt family of developmental signaling 

proteins. Wnt-1 (previously known as int-1), the seminal member of this family, (Nusse R., 

Trends Genet. (1988) 4:291) is a proto-oncogene induced by the integration of the mouse 

mammary tumor virus. It is thought to play a role in intercellular communication and seems 

to be a signalling molecule important in the development of the central nervous system 

20 (CNS). The sequence of wnt-1 is highly conserved in mammals, fish, and amphibians. Wnt- 

1 was found to be a member of a large family of related proteins (Nusse R., et ol., Cell 
(1992) 69:1073; McMahon A.P., Trends Genet. (1992) 8:1; Moon R.T., BioEssays (1993) 
75:91) that are all thought to be developmental regulators. These proteins are known as wnt- 

2 (also known as irp), wnt-3, -3 A, -4, -5A, -5B, -6, -7A, -7B, -8, -8B, -9 and -10. At least 
25 four members of this family are present in Drosophila; one of them, wingless (wg), is 

implicated in segmentation polarity. All these proteins share the following features 

characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites 

and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 

seem to adhere to the plasma membrane of the secreting cells and are therefore likely to 

30 signal over only few cell diameters. The consensus pattern, which is based upon a highly 

conserved region including three cysteines, is as follows: C-K-C-H-G-[LIVMT]-S-G-x-C. 

All sequences known to belong to this family are detected by the provided consensus pattern. 

s) Ww/rsp5/WWP Domain-Containing Proteins. SEQ ID NOS:l 88, 379 , and 395 

represent polynucleotides encoding a polypeptide in the family of WW/rsp5/WWP domain- 
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containing proteins. The WW domain (Bork et al, Trends Biochem. Sci. (1994) 79:531; 

Andre et aL 9 Biochem. Biophys. Res. Commun. (1994) 205:1201; Hofmann et aL, FEBS Lett. 

(1995) 555:153; and Sudol et al. 9 FEBS Lett. (1995) 369:67), also known as rsp5 or WWP), 

was originally discovered as a short conserved region in a number of unrelated proteins, 

5 among them dystrophin, the gene responsible for Duchenne muscular dystrophy. The 

domain, which spans about 35 residues, is repeated up to 4 times in some proteins. It has 

been shown (Chen et al.Proc. Natl Acad Set USA (1995) 92:7819) to bind proteins with 

particular proline-motifs, [AP]-P-P-[AP]-Y, and thus resembles somewhat SH3 domains. It 

appears to contain beta-strands grouped around four conserved aromatic positions, generally 

10 Trp. The name WW or WWP derives from the presence of these Trp as well as that of a 

conserved Pro. It is frequently associated with other domains typical for proteins in signal 

transduction processes. 

Proteins containing the WW domain include: 

1. Dystrophin, a multidomain cytoskeletal protein. Its longest alternatively 

1 5 spliced form consists of an N-terminal actin-binding domain, followed by 24 spectrin-like 
repeats, a cysteine-rich calcium-binding domain and a C-terminal globular domain. 
Dystrophins form tetramers and is thought to have multiple functions including involvement 
in membrane stability, transduction of contractile forces to the extracellular environment and 
organization of membrane specialization. Mutations in the dystrophin gene lead to muscular 

20 dystrophy of Duchenne or Becker type. Dystrophin contains one WW domain C-terminal of 
the spectrin-repeats. 

2. Vertebrate YAP protein, which is a substrate of an unknown serine kinase. It 
binds to the SH3 domain of the Yes oncoprotein via a proline-rich region. This protein 
appears in alternatively spliced isoforms, containing either one or two WW domains. 

25 3. IQGAP, which is a human GTPase activating protein acting on ras. It 

contains an N-terminal domain similar to fly muscle mp20 protein and a C-terminal ras 

GTPase activator domain. 

For the sensitive detection of WW domains, the profile spans the whole homology 

region as well as a pattern. The consensus for this family is: W-x(9,l 1)-[VFY]-[FYW]- 

30 x(6,7)-[GSTNE]-[GSTQCR]-[FYW]-x(2)-P. 

t) Zinc Finger. C2H2 Type. SEQ ID NO:61, 306, and 386 correspond to 

polynucleotides encoding novel members of the of the C2H2 type zinc finger protein family. 

Zinc finger domains (Klug et al. 9 Trends Biochem. Sci. (1987) 72:464; Evans et al, Cell 

(1988) 52:1; Payre et al, FEBS Lett (1988) 254:245; Miller et al, EMBOJ. (1985) 4:1609; 
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and Berg, Proc. Natl. Acad. Sci. USA (1988) 55:99) are nucleic acid-binding protein 
structures first identified in the Xenopus transcription factor TFIIIA. These domains have 
since been found in numerous nucleic acid-binding proteins. A zinc finger domain is 
composed of 25 to 30 amino acid residues. Two cysteine or histidine residues are positioned 
5 at both extremities of the domain, which are involved in the tetrahedral coordination of a 
zinc atom. It has been proposed that such a domain interacts with about five nucleotides. 

Many classes of zinc fingers are characterized according to the number and positions 
of the histidine and cysteine residues involved in the zinc atom coordination. In the first 
class to be characterized, called C2H2, the first pair of zinc coordinating residues are 
10 cysteines, while the second pair are histidines. A number of experimental reports have 

demonstrated the zinc-dependent DNA or RNA binding property of some members of this 
class. 

Mammalian proteins having a C2H2 zipper include (number in parenthesis indicates 
number of zinc finger regions in the protein): basonuclin (6), BCL-6/LAZ-3 (6), erythroid 

15 krueppel-like transcription factor (3), transcription factors Spl (3), Sp2 (3), Sp3 (3) and 
Sp(4) 3, transcriptional repressor YY1 (4), Wilms' tumor protein (4), EGRl/Krox24 (3), 
EGR2/Krox20 (3), EGR3/Pilot (3), EGR4/AT133 (4), Evi-1 (10), GLI1 (5), GLI2 (4+), 
GLI3 (3+), HIV-EP1/ZNF40 (4), HIV-EP2 (2), KR1 (9+), KR2 (9), KR3 (15+), KR4 (14+), 
KR5 (1 1+), HF.12 (6+), REX-1 (4), ZfX (13), ZfY (13), Zfp-35 (18), ZNF7 (15), ZNF8 (7), 

20 ZNF35 (10), ZNF42/MZF-1 (13), ZNF43 (22), ZNF46/Kup (2), ZNF76 (7), ZNF91 (36), 
ZNF133 (3). 

In addition to the conserved zinc ligand residues, it has been shown that a number of 
other positions are also important for the structural integrity of the C2H2 zinc fingers. 
(Rosenfeld et al.,J. Biomol Struct. Dyn. (1993) J 7:557) The best conserved position is 
25 found four residues after the second cysteine; it is generally an aromatic or aliphatic residue. 

The consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]- 
x(8)-H-x(3,5)-H. The two C's and two H's are zinc ligands. 

u) Zinc Finger, CCHC Class. SEQ ID NO:322 corresponds to a polynucleotide 
encoding a novel member of the zinc finger CCHC family. The CCHC zinc finger protein 
30 family to date has been mostly composed of retroviral gag proteins (nucleocapsid). The 

prototype structure of this family is from HIV. The family also contains members involved 
in eukaryotic gene regulation, such as C. elegans GLH-1. The consensus sequence of this 
family is based upon the common structure of an 18-residue zinc finger. 
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v) Zinc-Binding Metalioprotease Domain. SEQ ID NO:306 and 395 represent 

polynucleotides encoding novel members of the zinc-binding metalioprotease domain 

protein family. The majority of zinc-dependent metallopeptidases (with the notable 

exception of the carboxypeptidases) share a common pattern of primary structure (Jongeneel 

5 et al , FEBS Lett ( 1 989) 242:2 1 1 ; Murphy et al , FEBS Lett. ( 1 99 1 ) 289:4; and Bode et al , 

Zoology (1996) 99:237) in the part of their sequence involved in the binding of zinc, and can 

be grouped together as a superfamily, known as the metzincins, on the basis of this sequence 

similarity. Examples of these proteins include: 1) Angiotensin-converting enzyme (EC 

3.4.15.1) (dipeptidyl carboxypeptidase I) (ACE), the enzyme responsible for hydrolyzing 

10 angiotensin I to angiotensin II. 2) Mammalian extracellular matrix metalloproteinases 
(known as matrixins) (Woessner, FASEBJ. (1991) 5:2145): MMP-1 (EC 3.4.24.7) 
(interstitial collagenase), MMP-2 (EC 3.4.24.24) (72 Kd gelatinase), MMP-9 (EC 3.4.24.35) 
(92 Kd gelatinase), MMP-7 (EC 3.4.24.23) (matrylisin), MMP-8 (EC 3.4.24.34) (neutrophil 
collagenase), MMP-3 (EC 3.4.24.17) (stromelysin-1), MMP-10 (EC 3.4.24.22) 

15 (stromelysin-2), and MMP-1 1 (stromelysin-3), MMP-12 (EC 3.4.24.65) (macrophage 
metalloelastase). 3) Endothelin-converting enzyme 1 (EC 3.4.24.71) (ECE-1), which 
processes the precursor of endothelin to release the active peptide. 

A signature pattern which includes the two histidine and the glutamic acid residues is 
sufficient to detect this superfamily of proteins, having the consensus pattern: [GSTALIVN]- 

20 x(2)-H-E-[LIVMFYW]-{DEHRKP}-H-x-[LIVMFYWGSPQ]. The two H's are zinc 
ligands, and E is the active site residue. 

Example 4: Differential Expression of Polynucleotides of the Invention : Description of 
Libraries and Detection of Differential Expression 

25 The relative expression levels of the polynucleotides of the invention was assessed in 

several libraries prepared from various sources, including cell lines and patient tissue 
samples. Table 4 provides a summary of these libraries, including the shortened library 
name (used hereafter), the mRNA source used to prepared the cDNA library, the "nickname" 
of the library that is used in the tables below (in quotes), and the approximate number of 

30 clones in the library. 
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Table 4 Description of cDNA Libraries 



Li bra ry 
(lib #) 


Description 


Number of 
Clones in this 
Clustering 


1 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential (derived 
fromKml2C) 


307133 


2 


Kml2C 

Human Colon Cell Line, Low Metastatic Potential 
low v^-oion 


284755 


3 


MDA-MB-231 

Human Breast Cancer Cell Line, High Metastatic Potential; 
micro- ijieiasiases in lung 
"High Breast" 


326937 


4 


MCF7 

Human Breast Cancer Cell, Non Metastatic 
Tow Breast" 


318979 


o 


1VI V- jZz 

Human Lung Cancer Cell Line, High Metastatic Potential 
"High Lung" 


223620 


O 

y 


Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 




Human microvascular endothelial cells (HMEC) — Untreated 
PCR (OligodT) cDNA library 


41938 


13 


Human microvascular endothelial cells (HMEC) - bFGF 
treated 

PCR (OligodT) cDNA library 


42100 


14 


Human microvascular endothelial cells (HMEC) - VEGF 
PCR (OligodT) cDNA library 


42825 


15 


Normal Colon - UC#2 Patient 
r Li\ ^vjjigoai ) cujn a. norary 
"Normal Colon Tumor Tissue" 


34285 


16 


Colon Tumor - UC#2 Patient 
i v^in. ^v_jng(jui ) cl/in/\ iiDrary 
"Normal Colon Tumor Tissue" 


35625 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
"High Colon Metastasis Tissue" 


36984 


18 


Normal Colon - UC#3 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


^fO 1 f\ 

JUZ. 1 o 


19 


Colon Tumor- UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Tumor Tissue" 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


30956 
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The KM12L4 and KM12C cell lines are described in Example 1 above. The MDA- 
MB-231 cell line was originally isolated from pleural effusions (Cailleau, J. Natl Cancer. 
Inst. (1974) 53:661), is of high metastatic potential, and forms poorly differentiated 
adenocarcinoma grade II in nude mice consistent with breast carcinoma. The MCF7 cell line 
5 was derived from a pleural effusion of a breast adenocarcinoma and is non-metastatic. The 
MV-522 cell line is derived from a human lung carcinoma and is of high metastatic 
potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the MV- 
522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as 
models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al, 

10 Cancer Res. (1979) 39:870 (MDA-MB-231 and MCF-7 ); Gastpar et al , J Med Chem (1998) 
4 1 :4965 (MDA-MB-23 1 and MCF-7); Ranson et al , Br J Cancer ( 1 998) 77: 1 586 (MDA- 
MB-231 and MCF-7); Kuang et al, Nucleic Acids Res (1998) 25:1 1 16 (MDA-MB-231 and 
MCF-7); Varki et al , Int J Cancer (1987) 40:46 (UCP-3); Varki et al, Tumour Biol (1990) 
77:327; (MV-522 and UCP-3); Varki et al, Anticancer Res. (1990) 70:637; (MV-522); 

1 5 Kelner et al, Anticancer Res ( 1 995) 75:867 (MV-522); and Zhang et al , Anticancer Drugs 
(1997) 5:696 (MV522)). The samples of libraries 15-20 are derived from two different 
patients (UC#2, and UC#3). 

Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate 

20 the analysis of the millions of sequences in each library, the sequences were assigned to 
clusters. The concept of "cluster of clones" is derived from a sorting/grouping of cDNA 
clones based on their hybridization pattern to a panel of roughly 300 7bp oligonucleotide 
probes (see Drmanac et al, Genomics (1996) 37(1):29). Random cDNA clones from a 
tissue library are hybridized at moderate stringency to 300 7bp oligonucleotides. Each 

25 oligonucleotide has some measure of specific hybridization to that specific clone. The 
combination of 300 of these measures of hybridization for 300 probes equals the 
"hybridization signature" for a specific clone. Clones with similar sequence will have 
similar hybridization signatures. By developing a sorting/grouping algorithm to analyze 
these signatures, groups of clones in a library can be identified and brought together 

30 computationally. These groups of clones are termed "clusters". Depending on the 

stringency of the selection in the algorithm (similar to the stringency of hybridization in a 

classic library cDNA screening protocol), the "purity" of each cluster can be controlled. For 

example, artifacts of clustering may occur in computational clustering just as artifacts can 

occur in "wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at even the 
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highest stringency. The stringency used in the implementation of cluster herein provides 
groups of clones that are in general from the same cDNA or closely related cDNAs. Closely 
related clones can be a result of different length clones of the same cDNA, closely related 
clones from highly related gene families, or splice variants of the same cDNA. 
5 Differential expression for a selected cluster was assessed by first determining the 

number of cDNA clones corresponding to the selected cluster in the first library (Clones in 
1 st ), and the determining the number of cDNA clones corresponding to the selected cluster in 
the second library (Clones in 2 nd ). Differential expression of the selected cluster in the first 
library relative to the second library is expressed as a "ratio" of percent expression between 
10 the two libraries. In general, the "ratio" is calculated by: 1) calculating the percent 
expression of the selected cluster in the first library by dividing the number of clones 
corresponding to a selected cluster in the first library by the total number of clones analyzed 
from the first library; 2) calculating the percent expression of the selected cluster in the 
second library by dividing the number of clones corresponding to a selected cluster in a 
15 second library by the total number of clones analyzed from the second library; 3) dividing 
the calculated percent expression from the first library by the calculated percent expression 
from the second library. If the "number of clones" corresponding to a selected cluster in a 
library is zero, the value is set at 1 to aid in calculation. The formula used in calculating the 
ratio takes into account the "depth" of each of the libraries being compared, i.e., the total 
20 number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially expressed 
between two samples when the ratio value is greater than at least about 2, preferably greater 
than at least about 3, more preferably greater than at least about 5 , where the ratio value is 
calculated using the method described above. The significance of differential expression is 
25 determined using a z score test (Zar, Biostatistical Analysis . Prentice Hall, Inc., USA, 
"Differences between Proportions," pp 296-298 (1974). 

Tables 5 to 7 (inserted before the claims) show the number of clones in each of the 
above libraries that were analyzed for differential expression. Examples of differentially 
expressed polynucleotides of particular interest are described in more detail below. 

30 

Example 5: Polynucleotides Differentially Expressed in High Metastatic Potential Breast 
Cancer Cells Versus Low Metastatic Breast Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential breast cancer tissue and low 
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metastatic breast cancer cells. Expression of these sequences in breast cancer can be 
valuable in determining diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells can be indicative of 
increased expression of genes or regulatory sequences involved in the metastatic process. A 
5 patient sample displaying an increased level of one or more of these polynucleotides may 
thus warrant more aggressive treatment. In another example, sequences that display higher 
expression in the low metastatic potential cells can be associated with genes or regulatory 
sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 
sample may warrant a more positive prognosis than the gross pathology would suggest. 
10 The differential expression of these polynucleotides can be used as a diagnostic 

marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 



1 5 expression between high metastatic potential breast cancer cells and low metastatic potential 
breast cancer cells. 



The following table summarizes identified polynucleotides with differential 
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Table 8. Differentially expressed polynucleotides: High metastatic potential breast 

cancer vs. low metastatic breast cancer cells 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


l 5 ' 
Library 


2 nd 
Library 




9 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 


42 


High Breast > Low Breast (Lib3 > Lib4) 


307 


196 


75 


2.549721 


52 


High Breast > Low Breast (Lib3 > Lib4) 


19 


1364 


525 


2.534854 


62 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 


65 


High Breast > Low Breast (Lib3 > Lib4) 


5749 


9 


0 


8.780930 


66 


High Breast > Low Breast (Lib3 > Lib4) 


6455 


6 


0 


5.853953 


68 


High Breast > Low Breast (Lib3 > Lib4) 


6455 


6 


0 


5.853953 


114 


High Breast > Low Breast (Lib3 > Lib4) 


2030 


32 


4 


7.805271 


123 


High Breast > Low Breast (Lib3 > Lib4) 


3389 


13 


2 


6.341782 


144 


High Breast > Low Breast (Lib3 > Lib4) 


4623 


12 


2 


5.853953 


172 


High Breast > Low Breast (Lib3 > Lib4) 


102 


278 


1 16 


2.338217 


178 


High Breast > Low Breast (Lib3 > Lib4) 


3681 


10 


1 


9.756589 


214 


High Breast > Low Breast (Lib3 > Lib4) 


3900 


8 


1 


7.805271 


219 


High Breast > Low Breast (Lib3 > Lib4) 


3389 


13 


2 


6.341782 


223 


High Breast > Low Breast (Lib3 > Lib4) 


1399 


19 


7 


2.648217 


258 


High Breast > Low Breast (Lib3 > Lib4) 


4837 


10 


0 


9.756589 


317 


High Breast > Low Breast (Lib3 > Lib4) 


1577 


25 


3 


8.130490 


* 379 


High Breast > Low Breast (Lib3 > Lib4) 


260 


27 


2 


13.17139 


4 


Low Breast > High Breast (Lib4 > Lib3) 


3706 


22 


4 


5.637215 


39 


Low Breast > High Breast (Lib4 > Lib3) 


4016 


6 


0 


6.149690 


74 


Low Breast > High Breast (Lib4 > Lib3) 


6268 


18 


3 


6.149690 


81 


Low Breast > High Breast (Lib4 > Lib3) 


40392 


8 


1 


8.199586 


130 


Low Breast > High Breast (Lib4 > Lib3) 


13183 


7 


0 


7.174638 


157 


Low Breast > High Breast (Lib4 > Lib3) 


5417 


9 


0 


9.224535 


162 


Low Breast > High Breast (Lib4 > Lib3) 


9685 


7 


0 


7.174638 


183 


Low Breast > High Breast (Lib4 > Lib3) 


7337 


16 


3 


5.466391 


202 


Low Breast > High Breast (Lib4 > Lib3) 


6124 


9 


1 


9.224535 


298 


Low Breast > High Breast (Lib4 > Lib3) 


1037 


22 


4 


5.637215 


338 


Low Breast > High Breast (Lib4 > Lib3) 


689 


36 


17 


2.170478 


384 


Low Breast > High Breast (Lib4 > Lib3) 


697 


72 


30 


2.459876 


386 


Low Breast > High Breast (Lib4 > Lib3) 


4568 


9 


0 


9.224535 


388 


Low Breast > High Breast (Lib4 > Lib3) 


5622 


13 


2 


6.662164 



5 Example 6: Polynucleotides Differentially Expressed in High Metastatic Potential Lung 
Cancer Cells Versus Low Metastatic Lung Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential lung cancer tissue and low 
metastatic lung cancer cells. Expression of these sequences in lung cancer tissue can be 
1 0 valuable in determining diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells are associated can 
be indicative of increased expression of genes or regulatory sequences involved in the 
metastatic process. A patient sample displaying an increased level of one or more of these 
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polynucleotides may thus warrant more aggressive treatment. In another example, 

sequences that display higher expression in the low metastatic potential cells can be 

associated with genes or regulatory sequences that inhibit metastasis, and thus the expression 

of these polynucleotides in a sample may warrant a more positive prognosis than the gross 

5 pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 

marker, a prognostic marker, for risk assessment, patient treatment and the like. These 

polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

10 The following table summarizes identified polynucleotides with differential 

expression between high metastatic potential lung cancer cells and low metastatic potential 

lung cancer cells: 



15 



Table 9 Differentially expressed polynucleotides: High metastatic potential lung 





cancer vs. low metastatic lung cancer cells 








SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ED 


1 st 
Library 


Library 




400 


High Lung > Low Lung (Lib8 > Lib 9) 


14929 


23 


16 


2.008868 


9 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


34 


High Lung > Low Lung (Lib8 > Lib9) 


5832 


5 


0 


6.987366 


42 


High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


62 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 


106 


High Lung > Low Lung (Lib8 > Lib9) 


10717 


8 


b 


11.17978 


119 


High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.52111 


361 


High Lung > Low Lung (Lib8 > Lib9) 


1120 


5 


0 


6.987366 


369 


High Lung > Low Lung (Lib8 > Lib9) 


2790 


6 


0 


8.384840 


371 


High Lung > Low Lung (Lib8 > Lib9) 


8847 


6 


1 


8.384840 


379 


High Lung > Low Lung (Lib8 > Lib9) 


260 


15 


0 


20.96210 


395 


High Lung > Low Lung (Lib8 > Lib9) 


13538 


9 


1 


12.57726 


135 


Low Lung > High Lung (Lib9 > Lib8) 


36313 


30 


1 


21.46731 


154 


Low Lung > High Lung (Lib9 > Lib8) 


5345 


27 


6 


3.220097 


160 


Low Lung > High Lung (Lib9 > Lib8) 


4386 


21 


3 


5.009039 


260 


Low Lung > High Lung (Lib9 > Lib8) 


4141 


27 


4 


4.830145 


308 


Low Lung > High Lung (Lib9 > Lib8) 


15855 


213 


12 


12.70149 


323 


Low Lung > High Lung (Lib9 > Lib8) 


5257 


25 


5 


3.577885 


349 


Low Lung > High Lung (Lib9 > Lib8) 


2797 


14 


1 


10.01807 


381 


Low Lung > High Lung (Lib9 > Lib8) 


2428 


19 


2 


6.797982 



Example 7: Polynucleotides Differentially Expressed in High Metastatic Potential Colon 
Cancer Cells Versus Low Metastatic Colon Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
20 expressed between cells derived from high metastatic potential colon cancer tissue and low 
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metastatic colon cancer cells. Expression of these sequences in colon cancer tissue can be 

valuable in determining diagnostic, prognostic and/or treatment information. For example, 

sequences that are highly expressed in the high metastatic potential cells can be indicative of 

increased expression of genes or regulatory sequences involved in the metastatic process. A 

patient sample displaying an increased level of one or more of these polynucleotides may 

thus warrant more aggressive treatment. In another example, sequences that display higher 

expression in the low metastatic potential cells can be associated with genes or regulatory 

sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and low metastatic potential 
colon cancer cells: 

Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
cancer vs. low metastatic colon cancer cells 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


I s ' 






1 


High Colon > Low Colon (Libl > Lib2) 




Library 


Library 




6660 


7 


0 


6.489973 


176 


High Colon > Low Colon (Libl > Lib2) 


3765 


19 


6 


2.935940 


241 


High Colon > Low Colon (Libl > Lib2) 


4275 


11 


2 


5.099264 


362 


High Colon > Low Colon (Libl > Lib2) 


6420 


8 


0 


7.417112 


374 


High Colon > Low Colon (Libl > Lib2) 


6420 


8 


0 


7.417112 


39 


Low Colon > High Colon (Lib2 > Libl) 


4016 


14 


5 


3.020043 


97 


Low Colon > High Colon (Lib2 > Libl) 


945 


21 


9 


2.516702 


134 


Low Colon > High Colon (Lib2 > Libl) 


2464 


19 


5 


4.098630 


317 


Low Colon > High Colon (Lib2 > Libl ) 


1577 


40 


12 


3.595289 


357 


Low Colon > High Colon (Lib2 > Libl) 


4309 


13 


4 


3.505407 



20 Example 8: Polynucleotides Differentially Expressed at Higher Levels in High Metastatic 

Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 

A number of polynucleotide sequences have been identified that are differentially 

expressed between cells derived from high metastatic potential colon cancer tissue and 

normal tissue. Expression of these sequences in colon cancer tissue can be valuable in 

determining diagnostic, prognostic and/or treatment information. For example, sequences 

that are highly expressed in the high metastatic potential cells are associated can be 
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indicative of increased expression of genes or regulatory sequences involved in the advanced 

disease state which involves processes such as angiogenesis, dedifferentiation, cell 

replication, and metastasis. A patient sample displaying an increased level of one or more of 

these polynucleotides may thus warrant more aggressive treatment. 

5 The differential expression of these polynucleotides can be used as a diagnostic 

marker, a prognostic marker, for risk assessment, patient treatment and the like. These 

polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 

10 expression between high metastatic potential colon cancer cells and normal colon cells: 



Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
tissue vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


I s * 
Library 


2 nd 
Library 




52 


High Colon Metastasis Tissue > Normal 
Colon Tissue of UC#3 (Lib20 > Lib 18) 


19 


10 


0 


1 1.69918 


52 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl 7 > Libl 5) 


19 


13 


2 


6.025646 


172 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl 7 > Libl5) 


102 


65 


22 


2.738930 



15 Example 9: Polynucleotides Differentially Expressed at Higher Levels in High Colon 
Tumor Potential Patient Tissue Versus Metastasized Colon Cancer Patient 
Tissue 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and cells 

20 derived from high metastatic potential colon cancer cells. Expression of these sequences in 
colon cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment 
information associated with the transformation of precancerous tissue to malignant tissue. 
This information can be useful in the prevention of achieving the advanced malignant state 
in these tissues, and can be important in risk assessment for a patient. 

25 The following table summarizes identified polynucleotides with differential 

expression between high tumor potential colon cancer tissue and cells derived from high 
metastatic potential colon cancer cells: 
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Table 12: Differentially expressed polynucleotides: High tumor potential colon tissue 

vs. metastatic colon tissue 



SEQID 
NO. 

52 

119 
172 



Differential Expression 

High Colon Tumor Tissue > Metastasis 
Tissue of UC#3 (Lib 19 > Lib20) 
High Colon Tumor Tissue > Metastasis 
Tissue of UC#3 (Lib 19 > Lib20) 
High Colon Tumor Tissue > Metastasis 
Tissue of UC#3 (Lib 19 > Lib20) 



Cluster 
ID 

19 
8 



Clones in 1 st 
Library 

69 
14 



Clones in 2 
Library 

10 
1 



nd 



Ratio 



102 



43 



10 



5.160829 



10.47124 



3.216368 



5 Example 10: Polynucleotides Differentially Expressed at Higher Levels in High Tumor 
Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and normal 
tissue. Expression of these sequences in colon cancer tissue can be valuable in determining 
1 0 diagnostic, prognostic and/or treatment information associated with the prevention of 

achieving the malignant state in these tissues, and can be important in risk assessment for a 
patient. For example, sequences that are highly expressed in the potential colon cancer cells 
are associated with or can be indicative of increased expression of genes or regulatory 
sequences involved in early tumor progression. A patient sample displaying an increased 
15 level of one or more of these polynucleotides may thus warrant closer attention or more 
frequent screening procedures to catch the malignant state as early as possible. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and normal colon cells: 

20 Table 13: Differentially expressed polynucleotides: High tumor potential colon tissue 
vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


I s ' 






52 


High Colon Tumor Tissue > Normal Tissue 
of UC#2 (Libl 6 > Lib 15) 




Library 


Library 




19 


13 


2 


6.255508 


288 


High Colon Tumor Tissue > Normal Tissue 


1267 


7 


0 


6.125253 




ofUC#2 (Libl 6 > Lib 15) 








52 


High Colon Tumor Tissue > Normal Tissue 


19 


69 


0 


60.37750 




ofUC#3 (Libl 9 > Lib 18) 








119 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3 (Libl9>Libl8) 


8 


14 


1 


12.25050 


172 


High Colon Tumor Tissue > Normal Tissue 


102 


43 


7 


5.375222 



ofUC#3 (Libl9>Libl8) 

114 



BNSDOCID: <WO 9933982A2_L> 



WO 99/33982 PCT/US98/27610 

Example 1 1 : Polynucleotides Differentially Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cancerous cells and normal cells across all three tissue types tested (i.e., 
5 breast, colon, and lung). Expression of these sequences in a tissue or any origin can be 

valuable in determining diagnostic, prognostic and/or treatment information associated with 
the prevention of achieving the malignant state in these tissues, and can be important in risk 
assessment for a patient. These polynucleotides can also serve as non-tissue specific 
markers of, for example, risk of metastasis of a tumor. The following table summarizes 
10 identified polynucleotides that were differentially expressed but without tissue type- 
specificity in the breast, colon, and lung libraries tested. 



Table 14: Polynucleotides Differentially Expressed Across Multiple Library Comparisons 




t~\ i f*fV» r*o n t i •) 1 Fvnroccinn 

jjiiitrciiii4i jLApr^Mun 




V^-HJIICa 111 


■ * Inn pc ■ n 
I^IUIIO III 


Rutin 




in 


X 


It 










muWmtat y 


T ihrorv 

Liiuidi y 




Q 


Miah Rrpast > I r»w Rrf*fl<;t (1 ih^ > I ih4^ 
n i tii i DicaM LiUw uicaoi ^l^iuj - — <i j 


2623 


3 1 


4 


7 561356 




I_I \ ciY\ 1 una "> I r\\x/ I una (\ ihR "> T ihQ^ 

nign L/Ung low i^uug ^biuo ) 


9691 


A 
\J 


1 


8 384R40 

O .-J o*t t>*rv/ 






AM 6 


f. 


A 
\) 


6 1406Q0 








14 


5 


3 020043 


42 


Hi^h Rrea«5t > I r>w Breast (\ ih3 > 1 itvH 


307 


196 


75 


2.549721 




High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


52 


High Breast > Low Breast (Lib3 > Lib4) 


19 


1364 


525 


2.534854 




High Colon Metastasis Tissue > Normal 


19 


10 


0 


1 1.69918 




Colon Tissue of UC#3 (Lib20 > Libl 8) 












High Colon Metastasis Tissue > Normal 


19 


13 


2 


6.025646 




Tissue in UC#2 (Libl 7 > Libl 5) 












High Colon Tumor Tissue > Metastasis 


19 


69 


10 


5.160829 




Tissue of UC#3 (Libl 9 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


19 


13 


2 


6.255508 




ofUC#2 (Libl 6 > Libl 5) 












High Colon Tumor Tissue > Normal Tissue 


19 


69 


0 


60.37750 




ofUC#3 (Libl 9 > Libl 8) 










62 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 




High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 




Low Breast > High Breast (Lib4 > Lib3) 


6268 


18 


3 


6.149690 


119 


High Colon Tumor Tissue > Metastasis 


8 


14 


1 


10.47124 




Tissue of UC#3 (Lib 19 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


8 


14 


1 


12.25050 




ofUC#3 (Libl9> Libl 8) 












High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.521 11 


172 


High Breast > Low Breast (Lib3 > Lib4) 


102 


278 


116 


2.338217 




High Colon Metastasis Tissue > Normal 


102 


65 


22 


2.738930 




Tissue in UC#2 (Libl 7 > Libl 5) 












High Colon Tumor Tissue > Metastasis 


102 


43 


10 


3.216168 
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SEQ ID 
NO. 


Differential Expression Cluster Clones in 

ID 1 st 


Clones in 

L 


Ratio 




Tissue of UC#3 (Lib 19 > Lib20) 

High Colon Tumor Tissue > Normal Tissue 102 

ofUC#3 (Libl9> Libl8) 


Library 

43 


Library 

7 


5.375222 


1 1 7 


High Breast > Low Breast (Lib3 > Lib4) 1 577 
Low Colon > High Colon (Lib2 > Libl) 1577 


25 
40 


3 

12 


8.130490 
3.595289 


379 


High Breast > Low Breast (Lib3 > Lib4) 260 
High Lung > Low Lung (Lib8 > Lib9) 260 


27 
15 


2 
0 


13.17139 
20.96210 


Example 


■ 12: Polynucleotides Exhibiting Colon-Specific 


Expression 







The cDNA libraries described herein were also analyzed to identify those 
polynucleotides that were specifically expressed in colon cells or tissue, Le. 9 the 
5 polynucleotides were identified in libraries prepared from colon cell lines or tissue, but not 
in libraries of breast or lung origin. The polynucleotides that were expressed in a colon cell 
line and/or in colon tissue, but were present in the breast or lung cDNA libraries described 
herein, are shown in Table 15. 



10 Table 15 Polynucleotides specifically expressed in colon cells. 



SEQ ID 
NO. 



Cluster Clones in Clones in SEQ ID 
1 st Library 2 nd Library NO. 



Cluster Clones in 



Clones in 
I s ' Library 2 nd Library 



5 


36535 


2 


0 


229 


39648 


2 


0 


13 


27250 


2 


0 


231 


85064 


1 


0 


19 


16283 


3 


0 


234 


39391 


2 


0 


24 


16918 


4 


0 


236 


39498 


2 


0 


26 


40108 


2 


0 


242 


22113 


3 


0 


32 


32663 


1 


1 


247 


19255 


2 


0 


43 


39833 


2 


0 


252 


22814 


3 


0 


47 


18957 


3 


0 


253 


39563 


2 


0 


48 


39508 


2 


0 


254 


39420 


2 


0 


56 


7005 


8 


2 


257 


39412 


2 


0 


58 


18957 


3 


0 


261 


38085 


2 


0 


59 


18957 


3 


0 


265 


40054 


1 


0 


60 


16283 


3 


0 


266 


39423 


2 


0 


64 


13238 


4 


1 


267 


39453 


2 


0 


70 


39442 


2 


0 


270 


78091 


1 


0 


71 


17036 


4 


0 


276 


39168 


2 


0 


73 


7005 


8 


2 


277 


39458 


2 


0 


83 


11476 


6 


0 


278 


14391 


3 


1 


86 


39425 


2 


0 


279 


39195 


2 


0 


94 


21847 


2 


1 


282 


12977 


5 


0 


100 


16731 


3 


1 


284 


14391 


3 


1 


101 


12439 


4 


0 


290 


16347 


4 


0 


113 


17055 


4 


0 


293 


39478 


2 


0 


120 


67907 


1 


0 


294 


39392 


2 


0 


121 


12081 


4 


0 


297 


39180 


2 


0 


124 


39174 


2 


0 


299 


6867 


7 


3 
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SEQ ID 


Cluster 


Clones in Clones in 


SEQ ID 


Cluster 


Clones in 




NO. 




1 


Library 2 Library 


NO. 




1 Library 


2 


126 


8210 


2 


6 


301 


41633 


1 


1 


128 


40455 


2 


0 


302 


23218 


3 


0 


339 


22195 


*> 
3 


A 


O ~> 

303 


39380 


2 


(J 


143 


o s o r c\ 

86859 


I 


n 
U 


309 


84328 


1 


U 


150 


8672 


4 


4 


314 


14367 


-> 
3 


r\ 

0 


153 


16977 


4 


a 
U 


320 


39886 


2 


u 


156 


1 7036 


4 


A 

u 


324 


9061 


5 




159 


A t\f\ A A 

40044 


Z 


A 
U 


*5 O *7 

327 


16653 


-> 
3 


1 

1 


161 


40044 


I 


A 


328 


1 6985 


4 


0 


163 


22155 


3 


U 


329 


1 2977 


5 


0 


166 


1 c r\ f /" 

15066 


A 

4 


U 


330 


9061 


5 




170 


1 1465 


5 


U 


333 


16392 


3 


0 


176 


3765 


19 


6 


342 


39486 


2 


0 


181 


861 10 


1 


A 


344 


6874 


6 


3 


182 


39648 


2 


0 


345 


O "7 A 

6874 


6 


~\ 

-) 


185 


17076 


A 

4 


0 


353 


1 1494 


4 


0 


186 


22794 


2 


0 


354 


17062 


3 


0 


187 


39171 


2 


0 


355 


16245 


4 


0 


194 


40455 


2 


0 


356 


83 103 


1 


0 


199 


16317 


3 


0 


358 


13072 


4 


1 


210 


39186 


2 


0 


366 


14364 


1 


0 


21 1 


40122 


2 


0 


368 


84182 


1 


0 


218 


26295 


2 


0 


372 


56020 


1 


0 


222 


4665 


5 


9 


389 


7514 


5 


3 


226 


82498 


1 


0 


391 


7570 


5 


3 


227 


35702 


2 


0 


393 


23210 


3 


0 
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Clones in 
nd Library 



In addition to the above, SEQ ID NOS: 159 and 161 were each present in one clone in 
each of Lib 16 (Normal Colon Tumor Tissue), and SEQ ID NOS:344 and 345 were each 
present in one clone in Libl 7 (High Colon Metastasis Tissue). No clones corresponding to 
the colon-specific polynucleotides in the table above were present in any of Libraries 3, 4, 8, 
or 9. The polynucleotide provided above can be used as markers of cells of colon origin, and 
find particular use in reference arrays, as described above. 



Example 13: Identification of Contiguous Sequences Having a Polynucleotide of the 

10 Invention 

The novel polynucleotides were used to screen publicly available and proprietary 
databases to determine if any of the polynucleotides of SEQ ID NOS: 1-404 would facilitate 
identification of a contiguous sequence, e.g., the polynucleotides would provide sequence 
that would result in 5' extension of another DNA sequence, resulting in production of a 

15 longer contiguous sequence composed of the provided polynucleotide and the other DNA 
sequence(s). Contiging was performed using the AssemblyLign program with the following 
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parameters: 1) Overlap: Minimum Overlap Length: 30; % Stringency: 50; Minimum 
Repeat Length: 30; Alignment: gap creation penalty: 1.00, gap extension penalty: LOO; 2) 
Consensus: % Base designation threshold: 80. 

Using these parameters, 44 polynucleotides provided contiged sequences. These 
5 contiged sequences are provided as SEQ ID NOS:80 1-844. The contiged sequences can be 
correlated with the sequences of SEQ ID NOS: 1-404 upon which the contiged sequences are 
based by identifying those sequences of SEQ ID NOS: 1-404 and the contiged sequences of 
SEQ ID NOS: 80 1-844 that share the same clone name in Table 1 . It should be noted that of 
these 44 sequences that provided a contiged sequence, the following members of that group 
10 of 44 did not contig using the overlap settings indicated in parentheses (Stringency/Overlap): 
SEQ ID NO:804 (30%/ 10); SEQ ID NO:810 (20%/20); SEQ ID NO:812 (30%/10); SEQ ID 
NO:814 (40%/20); SEQ ID NO:816 (30%/10); SEQ ID NO:832 (30%/10); SEQ ID NO:840 
(20%/20); SEQ ID NO:841 (40%/20). To generalize, the indicated polynucleotides did not 
contig using a minimum 20% stringency, 10 overlap. There was a corresponding increase in 
1 5 the number of degenerate codons in these sequences. 

The contiged sequences (SEQ ID NO:80 1-844) thus represent longer sequences that 
encompass a polynucleotide sequence of the invention. The contiged sequences were then 
translated in all three reading frames to determine the best alignment with individual 
sequences using the BLAST programs as described above for SEQ ID NOS: 1-404 and the 
20 validation sequences SEQ ID NOS:405-800. Again the sequences were masked using the 
XBLAST profram for masking low complexity as described above in Example 1 (Table 2). 
Several of the contiged sequences were found to encode polypeptides having characteristics 
of a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 16). Thus the 
25 invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 
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Table 16. Profile hits using contiged sequences 



SEQ 

ID 

NO. 


Sequence Name 


Profile 


Start 
(Stop) 


Score 


809 


Contig RTA00000177AF.n.l8.3. 
Seq_THC 123051 


ATPases 


778 
(1612) 


6040 


824 


Contig RTA00000187AF.g.24.1. 
Seq_THC 168636 


homeobox 


531 
(707) 


12080 


824 


Contig RTA00000187AF.g.24.1. 
Seq_THC 168636 


MAP kinase kinase 


769 
(1494) 


5784 [ 


833 


Contig RTA00000190AFJ.4.1. 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 


833 


Contig RTA00000190AF.j.4.1. 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 1 



All stop/start sequences are provided in the forward direction. 



5 The profiles for the ATPases (AAA) and protein kinase families are described above 

in Example 2. The homeobox and MAP kinase kinase protein families are described further 
below. 

Homeobox domain. The 'homeobox' is a protein domain of 60 amino acids (Gehring 
In: Guidebook to the Homeobox Genes , Duboule D., Ed., ppl-10, Oxford University Press, 

1 0 Oxford, (1994); Buerglin In: Guidebook to the Homeobox Genes , pp25-72, Oxford 

University Press, Oxford, (1994); Gehring Trends Biochem. Sci. (1992) 77:277-280; Gehring 
etalAnnu. Rev. Genet (1986) 20:147-173; Schofield Trends Neurosci. (1987) 70:3-6; 
http://copan.bioz.unibas.ch/ homeo.html) first identified in number of Drosophila homeotic 
and segmentation proteins. It is extremely well conserved in many other animals, including 

1 5 vertebrates. This domain binds DNA through a helix-turn-helix type of structure. Several 
proteins that contain a homeobox domain play an important role in development. Most of 
these proteins are sequence-specific DNA-binding transcription factors. The homeobox 
domain is also very similar to a region of the yeast mating type proteins. These are 
sequence-specific DNA-binding proteins that act as master switches in yeast differentiation 

20 by controlling gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. The helix- 
turn-helix region is shown by the symbols 'H' (for helix), and T (for turn). 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHtttHHHHHHHHHxxxxxxxxxx 
25 1 60 
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The pattern detects homeobox sequences 24 residues long and spans positions 34 to 57 of the 

homeobox domain. The consensus pattern is as follows: [LIVMFYG]-[ASLVR]-x(2)- 

[LIVMSTACN]-x-[LIVM]-x(4).[LIV]-[RKNQESTAIY]-[LIVFSTO 
[NDQTAH]-x(5)-[RKNAIMW] . 

5 MAP kinase kinase fMAPKKV MAP kinases (MAPK) are involved in signal 

transduction, and are important in cell cycle and cell growth controls. The MAP kinase 
kinases (MAPKK) are dual-specificity protein kinases which phosphorylate and activate 
MAP kinases. MAPKK homologues have been found in yeast, invertebrates, amphibians, 
and mammals. Moreover, the MAPKK/MAPK phosphorylation switch constitutes a basic 
10 module activated in distinct pathways in yeast and in vertebrates. MAPKK regulation 
studies have led to the discovery of at least four MAPKK convergent pathways in higher 
organisms. One of these is similar to the yeast pheromone response pathway which includes 
the stel 1 protein kinase. Two other pathways require the activation of either one or both of 
the serine/threonine kinase-encoded oncogenes c-Raf-1 and c-Mos. Additionally, several 
15 studies suggest a possible effect of the cell cycle control regulator cycl in-dependent kinase 1 
(cdc2) on MAPKK activity. Finally, MAPKKs are apparently essential transducers through 
which signals must pass before reaching the nucleus. For review, see, e.g., Biologique Biol 
Cell (1993) 79:193-207; Nishida et ah, Trends Biochem Sci (1993) 75:128-31; Ruderman 
Curr Opin Cell Biol (1993) 5:207-13; Dhanasekaran et al, Oncogene (1998) 77:1447-55; 
20 Kiefer et aL, Biochem Soc Trans (1997) 25:491-8; and Hill, Cell Signal (1996) 5:533-44. 

Those skilled in the art will recognize, or be able to ascertain, using not more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such specific embodiments and equivalents are intended to be 
encompassed by the following claims. 
25 All publications and patent applications cited in this specification are herein 

incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
30 prior invention. 
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Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it is readily apparent to 

those of ordinary skill in the art in light of the teachings of this invention that certain changes 

and modifications may be made thereto without departing from the spirit or scope of the 

appended claims. 



Deposit Information : 

The following materials were deposited with the American Type Culture Collection: 
CMCC = (Chiron Master Culture Collection) 

Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession No. 


KM12L4-A 


March 19, 1998 


CRL- 12496 


11606 


Kml2C 


May 15, 1998 


CRL-12533 


11611 


MDA-MB-231 


May 15, 1998 


CRL-12532 


10583 


MCF-7 


October 9, 1998 


CRL-12584 


10377 
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CDNA Library Deposits 

cDNA Library ES I - ATCC# 
Deposit Date - December 22, 1998 
Clone Name — ~~ 

M00001395A:C03 
M00001395A:C03 
M00001449A:D12 
M00001449A.D12 
M00001452A:D08 
M00001452A:D08 
M00001513A:B06 
M00001513A:B06 
M00001517A:B07 
M00001517A:B07 
M00001533A:C11 
M00001533A:C11 
M00001533A:C11 
M00001542A:A09 
MOOOO1542A:A09 

cDNA Library ES2 - ATCC# 
Deposit Date - December 22, 1998 



PO7US98/27610 



Cluster ID 


Sequence Name 


4016 


79.Al.sp6:130016.Seq 


4016 


RTA00000118A.C.4.1 


3681 


RTA00000131A.g.l5.2 


O O 1 

3681 


79.El.sp6:130064.Seq 


1 1 in 

1 120 


79.C2.sp6: 1 3004 1 .Seq 


1 120 


RTA00000118A.p.l5.3 


4568 


79.D4.sp6:130055.Seq 


4568 


RTA00000122A.d.l5.3 


4313 


79.F4.sp6:130079.Seq 


4313 


RTA00000122A.n.3.1 


2428 


RTA00000123A.1.21.1 


2428 


79.A5.sp6:130020.Seq 


2428 


RTA00000123A.1.21.1.Seq_THC205063 


22113 


79.F5.sp6:130080.Seq 


22113 


RTA00000125A.C.7.1 



Clone Name 


Cluster ID 


M00001343C:F10 


2790 


M00001343C:F10 


2790 


M00001343C:F10 


2790 


M00001343D:H07 


23255 


M00001343D:H07 


23255 


M00001343D.H07 


23255 


M00001343D:H07 


23255 


M00001345A:E01 


6420 


M00001345A:E01 


6420 


M00001345A:E01 


6420 


M00001345A:E01 


6420 


M00001347A:B10 


13576 


M00001347A:B10 


13576 


M00001347A:B10 


13576 


M00O01353A:G12 


8078 


M00001353A:G12 


8078 


M00001353A:G12 


8078 


M00001353D:D10 


14929 


M00O01353D:D10 


14929 


MO0OO1353D:D10 


14929 


M00001361A:A05 


4141 


MO0OO1361A:A05 


4141 


M00001362B:D10 


5622 


M00001362B:D10 


5622 



Sequence Name 

80.El.sp6:130256.Seq 

RTA000001 77AF.e.2. 1 .Seq_THC22946 1 

RTA00000177AF.e.2.1 

100.Cl.sp6:131446.Seq 

RTA000001 77AF.e. 1 4.3 .Seq_THC228776 

80.Fl.sp6:130268.Seq 

RTA00000177AF.e.l4.3 

172.El.sp6:133925.Seq 

RTA00000177AF.f.l0.3 

RTA00000 1 77 AF.f. 10.3. Seq_THC226443 

80.Gl.sp6:130280.Seq 

80.D2.sp6:130245.Seq 

100.El.sp6:131470.Seq 

RTA00000177AF.g.l6.1 

80.E3.sp6:130258.Seq 

RTA00000177AR.1.13.1 

172.C3.sp6:133903.Seq 

RTA00000177AF.m.l.2 

80.F3.sp6:130270.Seq 

172.D3.sp6:133915.Seq 

80.B4.sp6:130223.Seq 

RTA00000 1 77AF.p.20.3 

80.D4.sp6:130247.Seq 

RTA00000178AF.a.ll.l 
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cDNA Library ES3 - ATCC# 
Deposit Date - December 22, 



1998 



PC17US98/27610 



Clone Name 

M00001362C:H1 1 

M00001362C:H1 1 

M00001362C:H11 

M00001362C:H11 

M00001376B:G06 

M00001376B:G06 

M00001387A:C05 

M00001387A:C05 

M00001412B:B10 

M00001412B:B10 

M00001415A:H06 

M00001415A:H06 

M00001416B:H1 1 

M00001416B:H11 

M00001429D:D07 

M00001429D:D07 

M00001448D:H01 

M00001448D:H01 



Cluster ID 


Sequence Name 


945 


RTA000001 78AR.a.20. 1 


945 


100.E4.sp6:131473.Seq 


945 


80.E4.sp6:130259.Seq 


945 


180.C2.sp6:135940.Seq 


17732 


RTA00000178AR.i.2.2 


17732 


80.B5.sp6:130224.Seq 


2464 


80.D6.sp6:130249.Seq 


2464 


RTA00000178AF.n.l8.1 


8551 


RTA00000179AF.p.21.1 


8551 


80G7.sp6:130286.Seq 


13538 


80.B8.sp6:130227.Seq 


13538 


RTA00000180AF.a.24.1 


8847 


80.C8.sp6:130239.Seq 


8847 


RTA00000180AF.b.l6.1 


40392 


RTA00000180AF.j.8.1 


40392 


80.H9.sp6:130300.Seq 


36313 


80.All.sp6:130218.Seq 


36313 


RTA00000181AF.e.23.1 



cDNA Library ES4 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001463C:B11 


19 


RTA00000182AF.b.7.1 


M00001463C:B11 


19 


89.Dl.sp6:130703.Seq 


M00001470A:B10 


1037 


89.F2.sp6:130728.Seq 


M00001470A:B10 


1037 


RTA00000121A.f.8.1 


M00001497A:G02 


2623 


89.F3.sp6:130729.Seq 


M00001497A:G02 


2623 


RTA00000183AF.a.6.1 


M00001500A:E11 


2623 


RTA00000183AF.b.l4.1 


M00001500A:E11 


2623 


89.A4.sp6:130670.Seq 


M00001501D:C02 


9685 


RTA00000 1 83 AF.c. 11.1 .Seq_THC 1 09544 


M00001501D:C02 


9685 


RTA00000183AF.C.11.1 


M00001501D.C02 


9685 


89.C4.sp6:130694.Seq 


M0OO01504C:HO6 


6974 


89.F4.sp6:130730.Seq 


M0OO01504C:H06 


6974 


RTA00000183AF.d.9.1 


M00001504C:H06 


6974 


RTA00000 1 83 AF.d.9. 1 .Seq_THC223 1 29 


M00001504D:G06 


6420 


173.F5.SP6:134133.Seq 


M00001504D:G06 


6420 


89.G4.sp6:130742.Seq 


M00001504D:G06 


6420 


RTAO00O0 1 83 AF.d. 11.1 .Seq_THC226443 


M00001504D:G06 


6420 


RTA00000183AF.d.ll.l 


M00001528A:C04 


35555 


89.B6.sp6:130684.Seq 


M00001528A:C04 


7337 


RTA00000123A.b.l7.1 


M00001528A:C04 


35555 


184.A5.sp6:135530.Seq 
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cDNA Library ES5 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001537B:G07 

M00001537B:G07 

M00001541A:D02 

M00001541A:D02 

M00001544B:B07 

M00001544B:B07 

M00001546A:G11 

M00001546A:G11 

M00001549B:F06 

M00001549B:F06 

M00001556A:F11 

M00001556A:F11 

M00001556A:F11 

M00001556B:C08 

M00001556B:C08 



cDNA Library ES6 - ATCC# 
Deposit Date - December 22, 1998 



Cluster ID 


Sequence Name 


3389 


RTA00000183AF.m.l9.1 


3389 


89A8.sp6:130674.Seq 


3765 


89.C8.sp6:130698.Seq 


3765 


RTA00000135A.d.l.l 


6974 


89.A9.sp6:130675.Seq 


6974 


RTA00000184AF.a.l5.1 


1267 


89.D9.sp6:130711.Seq 


1267 


RTA00000125A.O.5.1 


4193 


89.G9.sp6:130747.Seq 


4193 


RTA00000184AF.e.l3.1 


1577 


173.C9.SP6:134101.Seq 


1577 


89.Fll.sp6:130737.Seq 


1577 


RTA00000184AF.i.23.1 


4386 


RTA00000184AF.j.4.1 


4386 


89.Hll.sp6:130761.Seq 



Clone Name 


Cluster ID 


M00001563B:F06 


102 


M00001563B:F06 


102 


M00001571C:H06 


5749 


M00001571C:H06 


5749 


M00001594B:H04 


260 


M00001594B:H04 


260 


MO0O01597C:H02 


4837 


M00001597C:H02 


4837 


M00001624C:F01 


4309 


M00001624C:F01 


4309 


M00001679A:A06 


6660 


M00001679A:A06 


6660 


M00001679A:A06 


6660 


M00003759B:B09 


697 


M00003759B:B09 


697 


M00003759B:B09 


697 


M00003844C:B11 


6539 


M00003844C:B11 


6539 


M00003844C:B11 


6539 


M00003857A:G10 


3389 


MO0OO3857A:G10 


3389 



Sequence Name 

RTA00000184AF.O.5.1 

90.Bl.sp6:130871.Seq 

90.El.sp6:130907.Seq 

RTA00000185AF.a.l9.1 

90.D2.sp6:130896.Seq 

RTA00000185AR.U2.2 

90.E2.sp6:130908.Seq 

RTA00000185AR.k.3.2 

90.C4.sp6:130886.Seq 

RTA00000186AF.e.22.1 

90.F6.sp6:130924.Seq 

122.B5.sp6:132089.Seq 

RTA00000187AF.h.l5.1 

90.G8.sp6:130938.Seq 

RTA00000188AF.d.6.1 

RTA000001 88AF.d.6. 1 .Seq_THC 1 78884 

176.D9.sp6:134556.Seq 

RTA00000189AF.d.22.1 

90.B10.sp6:130880.Seq 

90.All.sp6:130869.Seq 

RTA00000189AF.g.3.1 
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w 


J &*& Pf^T/IKQR/lTfilO 


cDNA Library ES7 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00003914C:F05 


3900 


99.El.sp6:13 1278. Seq 


M00003914C:F05 


3900 


RTAOOOOO 1 90 AF.g. 13.1 


M00003922A:E06 


23255 


RTA00000 1 90AF.j.4. 1 


M00003922A:E06 


23255 


99.Fl.sp6:131290.Seq 


M00003922A:E06 


23255 


RTAOOOOO 1 90AF.J.4. 1 .Seq_THC228776 


MOO0O3983A:A05 


9105 


99.C3.sp6:131256.Seq 


M00003983A:A05 


9105 


RTAOOOOO 1 9 1 AF.a.2 1 .2 


M00004028D:A06 


6124 


RTAOOOOO 1 9 1 AR.e.2.3 


M00004028D:A06 


6124 


99.D3.sp6: 131 268. Seq 


M00004031A:A12 


9061 


RTAOOOOO 1 9 1 AR.e. 1 1 .2 


M00004031A:A12 


9061 


RTAOOOOO 1 9 1 AR.e. 1 1 .3 


M00004087D:A01 


6880 


RTAOOOOO 1 9 1 AF.m.20. 1 


M00004087D:A01 


6880 


99.A5.sp6:13 1234. Seq 


M00004108A:E06 


4937 


99.E5.sp6: 13 1282. Seq 


M00004108A:E06 


4937 


RTAOOOOO 1 9 1 AF.p.2 1 . 1 


M00004114C:F11 


13183 


1 23.D5.sp6: 132305.Seq 


M00004114C:Fll 


13183 


RTAOOOOO 1 92 AF.a.24.1 


M00004114C:F11 


13183 


99.G5.sp6:131306.Seq 


cDNA Library ES8 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00004146C:C11 


5257 


99.B6.sp6: 13 1247. Seq 


M00004146C:C11 


5257 


177.F5.sp6:134768.Seq 


M00004146C:CU 


5257 


RTAOOOOO 1 92AF.0 . 1 


M00004146C:C11 


5257 


RTAOOOOO 1 92AF.f.3. 1 .Seq_THC2 1 3833 


M00004157C:A09 


6455 


RTAOOOOO 1 92AF.g.23 . 3 


M00004157C:A09 


6455 


99.D6.sp6: 13 1271. Seq 


M00004157C:A09 


6455 


1 23 .E7.sp6: 1 323 1 9. Seq 


M00004172C:D08 


11494 


RTAOOOOO 1 92AFJ.6. 1 


M00004172C:D08 


11494 


99.G6.sp6: 1 3 1 307. Seq 


M00004172C:D08 


11494 


1 77.E6.sp6: 1 34757. Seq 


M00004229B:F08 


6455 


RTAOOOOO 1 93 AF.b.9.1 


M00004229B:F08 


6455 


99.C8.sp6:131261.Seq 


cDNA Library ES9 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00001466A:E07 


4275 


RTAOOOOO 1 20A.J . 1 4. 1 


M00001531A:H11 




89.F6.sp6: 1 30732. Seq 


M00001531A:H11 




RTAOOOOO 1 23 A.g. 1 9. 1 


M00001551A:B10 


6268 


79.G9.sp6:l30O96.Seq 


M00001551A:B10 


6268 


l84.Cl2.sp6:l3556l.Seq 


MO0OO1551A:B10 


6268 


RTAOOOOO 1 26 A.o.23 . 1 


M00001552A:B12 


307 


RTAOOOOO 136A.0.4.2 


MO00O1552A:B12 


307 


79.C7.sp6:l30046.Seq 


M00001556A:H01 


15855 


RTAOOOOO 1 84AF.J. I . I 


MO00O1586C:CO5 


4623 


RTAOOOOO 1 85 AF.f.4. 1 


M00001604A:B10 


1399 


79.G8.sp6:l30095.Seq 


M00001604A:B10 


1399 


RTAOOOOO 1 29 A.o. 1 0. 1 


M00003879B:C11 


5345 


RTAOOOOO 1 89 AF. 1. 1 9. 1 


M00003879B:C11 


5345 


90.B12.sp6:130882.Seq 
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cDNA Library ES 1 0 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 


Cluster ID 


Sequence Name 


M00001358C:C06 




RTA00000177AF.O.4.3 


M00001388D:G05 


5832 


80.F6.sp6:130273.Seq 


M00001388D:G05 


5832 


RTA00000 1 78AF.o.23 . 1 


M00001394A:F01 


6583 


RTA00000179AF.d.l3.1 


M00001394A:F01 


6583 


172.B8.sp6:133896.Seq 


M00001394A:F01 


6583 


80.H6.sp6: 1 30297.Seq 


M00001429A:H04 


2797 


RTA00000180AF.i.l9.1 


M00001447A:G03 


10717 


RTA00000181AF.d.l0.1 


M00001448D:C09 


8 


80.H 1 O.spo: 1 30301 .Seq 


M00001448D:C09 


8 


RTA00000181AF.e.l7.1 


M00001448D:C09 


8 


100.Bll.sp6:131444.Seq 


M00001454D:G03 


689 


RTA00000 1 8 1 AR.1.22. 1 


cDNA Library ESI 1 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00003975A:G11 


12439 


RTA00000190AF.O.24.1 


M00003978B:G05 


5693 


RTA00000 1 90AF.p. 1 7.2.Seq_THC 1 733 1 8 


M00003978B:G05 


5693 


RTA00000 1 90AF.p. 17.2 


M00004059A:D06 


5417 


RTA00000191AF.h.l9.1 


M00004068B:A01 


3706 


99 C4 sn613 12^57 Sea 


M00004068B:A01 


3706 


RTA00000191AF.i.I7.2 


M00004205D:F06 




99.E7.sp6:131284.Seq 


M00004205D:F06 




177.G7.sp6:134782.Seq 


M00004205D:F06 




RTA00000 1 92AF.0. 11.1 


M00004212B:C07 


2379 


RTA00000192AF.p.8.1 


M00004223A:G10 


16918 


RTA00000193AF.3.16.1 


cDNA Library ESI 2 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00004223B:D09 


7899 


RTA00000193AF.a.l7.1 


M00004249D:G12 




RTA00000193AF.C.22.1 


M00004251C:G07 




RTA00000193AF.d.2.1 


M00004372A:A03 


2030 


RTA00000193AF.m.20.1 



126 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001340B:A06 

M00001340B:A06 

M00001340D:F10 

M00001340D:F10 

M00001341A:E12 

M00001341A:E12 

M00001342B:E06 

M00001342B:E06 

M00001346A:F09 

M00001346A:F09 

M00001346D:G06 

M00001346D:G06 

M00001348B:B04 

M0000I348B:B04 

M00001348B:G06 

M00001348B:G06 

M00001349B:B08 

M00001349B:B08 

M00001350A:H01 

MOOO01350A:HOl 

M00001350A:H01 

M00001352A:E02 

M00001352A:E02 

M00001352A:E02 

M00001355B:G10 

M00001355B:G10 

MOOO01355B:G10 

M00001355B:G10 

M00001361D:F08 

M00001361D:F08 

M00001365C:C10 

M00001365C:C10 

M00001368D:E03 

M00001368D:E03 

M00001370A:C09 

M00001370A:C09 

M00001371C:E09 

M00001371C:E09 

M00001371C:E09 

M00001378B:B02 

M00001378B:B02 

MO0OO1379A:AO5 

M00001379A:A05 

M00001380D:B09 

M00001380D:B09 

M00001381D:E06 

M00001381D:E06 

M00001382C:A02 

M00001382C:A02 

M00001384B:A11 

M00001384B:A11 

M00001386C:B12 



Cluster ID 

17062 

17062 

11589 

11589 

4443 

4443 

39805 

39805 

5007 

5007 

5779 

5779 

16927 

16927 

16985 

16985 

3584 

3584 

7187 

7187 

7187 

16245 

16245 

16245 

14391 

14391 

14391 

14391 

2379 

2379 

40132 

40132 



6867 

6867 

7172 

7172 

7172 

39833 

39833 

1334 

1334 

39886 

39886 



22979 
22979 



5178 



Sequence Name 

80.Al.sp6:130208.Seq 

RTA00000177AF.D.8.4 

80.Bl.sp6:130220.Seq 

RTA00000177AF.b.l7.4 

80.Cl.sp6:130232.Seq 

RTA00000177AF.b.20.4 

80.Dl.sp6:130244.Seq 

RTA00000177AF.C.21.3 

RTA00000177AF.g.2.1 

80.Hl.sp6:130292.Seq 

RTA00000177AF.g.l4.3 

RTA00000177AF.g.l4.1 

80.E2.sp6:130257.Seq 

RTA00000177AF.h.9.3 

RTA00000177AF.h.l0.1 

80.F2.sp6:130269.Seq 

RTA00000177AF.h.20.1 

80.G2.sp6:130281.Seq 

100.C2.sp6:131447.Seq 

80.A3.sp6:13021O.Seq 

RTA00000177AF.i.8.2 

RTA00000177AF.k.9.3 

172.D2.sp6:133914.Seq 

80.D3.sp6:130246.Seq 

RTA0O0O0 1 77 AF .m. 1 7 .3 

80.G3.sp6:130282.Seq 

172.H3.sp6:133963.Seq 

100.E3.sp6:131472.Seq 

80.C4.sp6:130235.Seq 

RTA00000178AF.a.6.1 

RTA00000178AF.C.7.1 

80.F4.sp6:130271.Seq 

80.G4.sp6:l 30283. Seq 

RTA00000178AF.d.20.1 

80.H4.sp6:130295.Seq 

RTA00000178AF.e.l2.1 

100A5.sp6:131426.Seq 

RTA00000178AF.f.9.1 

80.A5.sp6:130212.Seq 

80.C5.sp6:130236.Seq 

RTA00000178AF.i.23.1 

80.D5.sp6:130248.Seq 

RTA00000178AF.j.7.1 

RTA00000178AFJ.24.1 

80.E5.sp6:130260.Seq 

80F5.sp6:130272.Seq 

RTA00000178AF.k.l6.1 

80.G5.sp6:130284.Seq 

RTA00000178AF.k.22.1 

80.B6.sp6: 130225. Seq 

RTA00000178AF.m.l3.1 

80.C6.sp6:130237.Seq 
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Clone Name 

M00001386C:B12 

M00001387B.-G03 

M00001387B:G03 

M00001389A:C08 

M00001389A:C08 

M00001396A:C03 

MO0001396A:C03 

M00001396A:C03 

M00001400B:H06 

M00001400B:H06 

M00001400B:H06 

M00001400B:H06 

M00001402A:E08 

M00001402A:E08 

M00001407B:D1 1 

M00001407B:D1 1 

M00001410A:D07 

M00001410A:D07 

M00001410A:D07 

M00001414A:B01 

M00001414A:B01 

M00001414C:A07 

M00001414C:A07 

M00001416A:H01 

M00001416A:H01 

M00001417A:E02 

M00001417A:E02 

M00001423B:E07 

M00001423B:E07 

M00001424B:G09 

M00001424B:G09 

M00001425B:H08 

M00001425B:H08 

M00001426B:D12 

M00001426B:D12 

M00001426D:C08 

M00001426D:C08 

M00001428A:H10 

M00001428A:H10 

M00001428A:H10 

M00001449A:A12 

M00001449A:A12 

M00001449A:B12 

M00001449A:B12 

M00001449A:G10 

M00001449A.G10 

M00001449A:G10 

M00001449C:D06 

M00001449C:D06 

M00001450A:A02 

M00001450A:A02 

M00001450A:A02 



Cluster ID 

5178 

7587 

7587 

16269 

16269 

4009 

4009 

4009 



39563 

39563 

5556 

5556 

7005 

7005 

7005 



7674 

7674 

36393 

36393 

15066 

15066 

10470 

10470 

22195 

22195 



4261 

4261 

84182 

84182 

84182 

5857 

5857 

41633 

41633 

36535 

36535 

36535 

861 iO 

861 10 

39304 

39304 

39304 



Sequence Name 

RTA00000178AF.n.l0.1 

80.E6.sp6:l 30261. Seq 

RTA00000178AF.n.24.1 

RTA00000178AF.p.l.l 

80.G6.sp6:130285.Seq 

172.D8.sp6:133920.Seq 

80.A7.sp6:130214.Seq 

RTAOOOOO 1 79AF.e.20. 1 

172.B9.sp6:133897.Seq 

80.B7.sp6:130226.Seq 

RTAOOOOO 179AFJ. 13.1 

RTAOOOOO 1 79AFJ. 1 3. 1 .Seq_THC 1 05720 

80.C7.sp6.T30238.Seq 

RTA00000179AF.k.20.1 

RTAOOOOO 179AF.n. 10.1 

80.D7.sp6:130250.Seq 

180.H5.sp6:136003.Seq 

RTAOOOOO 1 79 AF.o.22.1 

80.F7.sp6:130274.Seq 

RTAOOOOO 1 80 AF.a.9.1 

80.H7.sp6:130298.Seq 

80.A8.sp6:130215.Seq 

RTAOOOOO 1 80 AF.a. 11.1 

79. Cl.sp6:130040.Seq 
RTAOOOOO 118A.g.9.1 
RTA00000180AF.C.2.1 

80. D8.sp6: 130251. Seq 
RTA00000180AF.e.24.1 
80.H8.sp6:130299.Seq 
80.A9.sp6: 13021 6.Seq 
RTAOOOOO 1 80AF.f. 18.1 
RTA00000180AF.g.7.1 
80.B9.sp6:130228.Seq 
RTAOOOOO 1 80 AF.g.22.1 
80.C9.sp6:130240.Seq 
80.D9.sp6:130252.Seq 
RTA00000180AF.h.5.1 
100.G9.sp6:131502.Seq 
RTAOOOOO 180AF.h. 19.1 
80.E9.sp6:130264.Seq 
80.Bll.sp6:130230.Seq 
RTAOOOOO 118A.g. 14.1 
80.Cll.sp6:130242.Seq 
RTAOOOOO 118A.g. 16.1 
RTAOOOOO 1 81 AF.f.5.1 
80.Dll.sp6:130254.Seq 
100.Dll.sp6:131468.Seq 
RTAOOOOO 181 AF.f. 12.1 
80.Ell.sp6:130266.Seq 

RTAOOOOO 1 1 8A.j.2 1 . 1 .Seq_THC 151859 
RTAOOOOO 118AJ.2 1.1 
79.Fl.sp6:130076.Seq 
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Clone Name 

M00001450A:A02 

M00001450A:A11 

M00001450A:A11 

M00001450A:B12 

M00001450A:B12 

M00001450A:B12 

M00001450A:D08 

M00001450A:D08 

MOO00145OA:D08 

M00001452A:B04 

M00001452A:B04 

M00001452A:B12 

M00001452A:B12 

M00001452A:F05 

M00001452A:F05 

M00001452C:B06 

M00001452C:B06 

M00001452C:B06 

M00001453A:E11 

M00001453A-.E11 

M00001453A:E11 

M00001453C:F06 

M00001453C:F06 

M00001454A:A09 

M00001454A:A09 

M00001454B:C12 

M00001454B:C12 

M00001454B:C12 

M00001455B:E12 

M00001455B:E12 

M00001460A:F06 

M00001460A:F06 

M00001461A:D06 

M00001461A:D06 

M00001465A:B11 

M00001465A:BU 

M00001467A:B07 

M00001467A:B07 

M00001467A:D04 

M00001467A:D04 

M00001467A:E10 

M00001467A:E10 

M00001468A:F05 

M00001468A:F05 

M00001469A:A01 

M00001469A:A01 

M00001469A:C10 

M00001469A:C10 

M00001469A:H12 

M00001469A:H12 

M00001470A:C04 

M00001470A:C04 



Cluster ID 

39304 

32663 

32663 

82498 

82498 

82498 

27250 

27250 

27250 

84328 

84328 

86859 

86859 

85064 

85064 

16970 

16970 

16970 

16130 

16130 

16130 

16653 

16653 

83103 

83103 

7005 

7005 

7005 

13072 

13072 

2448 

2448 

1531 

1531 

10145 

10145 

38759 

38759 

39508 

39508 

39442 

39442 

7589 

7589 



12081 
12081 
19105 
19105 
39425 
39425 



Sequence Name 

180.G9.sp6:135995.Seq 

80.Fll.sp6:130278.Seq 

RTA00000118A.1.8.1 

100.Fll.sp6:131492.Seq 

RTA00000118A.m.l0.1 

79. Gl.sp6:130088.Seq 

80. Gll.sp6:130290.Seq 
180.B10.sp6:135936.Seq 
RTA00000181AF.g.l0.1 
RTA00000118A.p.l0.1 
79.A2.sp6:130017.Seq 
RTA00000118A.p.8.1 
79.B2.sp6:130029.Seq 
RTA00000131A.m.23.1 

79. D2.sp6:130053.Seq 

80. Hll.sp6:130302.Seq 
100.C12.sp6:131457.Seq 
RTA00000181AR.i.l8.2 
80.A12.sp6:130219.Seq 
100.D12.sp6:131469.Seq 
RTA00000119A.C.13.1 
80.B12.sp6:130231.Seq 
RTA00000181AF.k.5.3 
RTA00000119A.e.24.2 

79. G2.sp6:130089.Seq 
121.Dl.sp6:131917.Seq 
RTA00000181AF.k.24.1 

80. C12.sp6:130243.Seq 
80.F12.sp6:130279.Seq 
RTA00000 1 8 1 AR.m.5.2 
89.Al.sp6:130667.Seq 
RTA00000119AJ.21.1 
89.Cl.sp6: 130691. Seq 
RTA000001 19A.o.3.1 
79.F3.sp6:130078.Seq 
RTA00000120A.g.l2.1 
89.Fl.sp6:130727.Seq 
RTA00000120A.m.l2.3 
RTA00000120A.O.2.1 
89.Gl.sp6:130739.Seq 
89.A2.sp6:130668.Seq 
RTA00000120A.O.21.1 
RTA00000120A.p.23.1 
89.B2.sp6:130680.Seq 
RTA00000121A.C.10.1 
89.C2.sp6:130692.Seq 
89.D2.sp6:130704.Seq 
RTA00000133A.d.l4.2 
89.E2.sp6:130716.Seq 
RTA00000133A.e.l5.1 
89.G2.sp6:130740.Seq 
RTA00000133A.f.l.l 
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Clone Name 

M00001471A:B01 

M00001471A:B01 

M00001487B:H06 

M00001487B:H06 

M00001488B:F12 

M00001488B:F12 

M00001494D:F06 

M00001494D:F06 

M00001499B:A11 

M00001499B:A11 

M00001499B:A11 

M00001500A:C05 

M00001500A:C05 

MOO0015O4A:E01 

M00001504A:E01 

M00001504A:E01 

M00001504C:A07 

M00001504C:A07 

M00001505C:C05 

MOO001505C:C05 

M00001506D:A09 

M00001506D:A09 

M00001506D:A09 

M00001507A:H05 

M00001507A:H05 

M00001535A:F10 

M00001535A:F10 

M00001541A:H03 

M00001541A.H03 

M00001544A:G02 

M00001544A:G02 

M00001545A:D08 

M00001545A:D08 

M00001551A:F05 

M00001551A:F05 

M00001552A:D11 

M00001552A:D11 

M00001557A:F03 



Cluster ID 

39478 

39478 



7206 

7206 

10539 

10539 

10539 

5336 

5336 



10185 
10185 



39168 
39168 
39423 
39423 
39174 
39174 
19829 
19829 
13864 
13864 
39180 
39180 
39458 
39458 
39490 



Sequence Name 

89.H2.sp6:130752.Seq 

RTA00000133A.i.5.1 

RTA00000182AF.1.15.1 

89.B3.sp6:l 30681. Seq 

RTA00000182AF.1.20.1 

89.C3.sp6:130693.Seq 

RTA00000182AF.O.15.1 

89.E3.sp6:130717.Seq 

RTA00000183AF.a.24.1 

89.G3.sp6: 130741. Seq 

173.B5.SP6:134085.Seq 

RTA00000183AF.b.l3.1 

89.H3.sp6:130753.Seq 

RTA00000183AF.C.24.1 

89.D4.sp6:130706.Seq 

RTA00000183AF.c.24.1.Seq_THC125912 

RTA00000183AF.d.5.1 

89.E4.sp6:130718.Seq 

89.H4.sp6:130754.Seq 

RTA00000183AF.e.l.l 

89.A5.sp6: 130671. Seq 

RTA00000183AF.e.23.1 

121.G6.sp6:131958.Seq 

RTA00000121A.1.10.1 

89.B5.sp6:130683.Seq 

79.C5.sp6:130044.Seq 

RTA00000134A.k.22.1 

79JE5.sp6:130068.Seq 

RTA00000124A.n.l3.1 

79.H5.sp6:130104.Seq 

RTA00000125A.h.24.4 

RTA00000125A.m.9.1 

79.B6.sp6:130033.Seq 

RTA00000126A.n.8.2 

79.A7.sp6:130022.Seq 

RTA00000126A.p.l5.2 

79.D7.sp6:130058.Seq 

RTA00000128A.b.4.1 
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Clone Name 

M0000151 1A.H06 

M00001S11A:H06 

M00001512A:A09 

M00001512A:A09 

M00001512D:G09 

M00001512D:G09 

M00001512D:G09 

M0OO01513B:G03 

M00001513B:G03 

M00001513B:G03 

M00001513C:E08 

M00001513C:E08 

M00001514C:D1 1 

M00001514C:D11 

M00001514C:D11 

M00001518C:B11 

M00001518C:B11 

M00001528B:H04 

M00001528B:H04 

M00O01531A:D01 

M00001531A:D01 

M00001534A:C04 

M00001534A:C04 

M00001534A:D09 

M00001534A:D09 

M00001534C:A01 

M00001534C:A01 

M00001535A:C06 

M00001535A:C06 

M00001535A:C06 

M00001536A:B07 

M00001536A:B07 

M00001537A:F12 

M00001537A:F12 

M00001540A:D06 

M00001540A:D06 

M00001542A:E06 

M00001542A:E06 

M00001544A:E06 

M00001544A:E06 

M00001544A:E06 

M00001545A:B02 

M00001545A:B02 

M00001548A:E10 

M00001548A:E10 

M00001548A:E10 

M00001549C:E06 

M00001549C:E06 

M00001550A:A03 

M00001550A:A03 

M00001550A:G01 

M00001550A:G01 



Cluster ID 

39412 

39412 

39186 

39186 

3956 

3956 

3956 



14364 

14364 

40044 

40044 

40044 

8952 

8952 

8358 

8358 

38085 

38085 

16921 

16921 

5097 

5097 

4119 

4119 

20212 

20212 

20212 

2696 

2696 

39420 

39420 

8286 

8286 

39453 

39453 



5892 

5892 

5892 

16347 

16347 

7239 

7239 

5175 

5175 



Sequence Name 

RTA00000133A.k.l7.1 

89.C5.sp6:l 30695. Seq 

89.D5.sp6:130707.Seq 

RTA00000121A.p.l5.1 

89.E5.sp6:130719.Seq 

173.H5.SP6:134157.Seq 

RTA00000183AF.g.3.1 

RTA00000183AF.g.9.1 

89.F5.sp6: 130731. Seq 

RTA00000 1 83 AF.g.9. 1 .Seq_THC 1 98280 

RTA00000183AF.g.l2.1 

89.G5.sp6:130743.Seq 

RTA00000183AF.g.22.1 

RTA000001 83AF.g.22. 1 .Seq_THC232899 

89.H5.sp6:130755.Seq 

89.A6.sp6:130672.Seq 

RTA00000183AF.h.l5.1 

89.D6.sp6:130708.Seq 

RTAOOOOO 1 83 AF.i.5.1 

RTA00000123A.e.l5.1 

89.E6.sp6:130720.Seq 

RTAOOOOO 1 83 AF.k.6.1 

89.H6.sp6:130756.Seq 

RTA00000134A.k.l.l 

RTAOOOOO 1 34A.k. 1 . 1 .Seq_THC2 1 5869 

RTAOOOOO 183AF.k. 16.1 

89.C7.sp6:130697.Seq 

89.E7.sp6: 130721. Seq 

RTAOOOOO 1 34 A.1.22. 1 .Seq_THC 1 28232 

RTAOOOOO 134A.1. 22.1 

RTAOOOOO 134A.m. 13.1 

89.F7.sp6: 130733. Seq 

89.H7.sp6:130757.Seq 

RTAOOOOO 134A.o.23.1 

89.B8.sp6:130686.Seq 

RTAOOOOO 1 83 AF.o. 1.1 

89.E8.sp6:130722.Seq 

RTAOOOOO 1 35 A.g. 11.1 

RTAOOOOO 1 84 AF.a.8.1 

173.G7.SP6:134147.Seq 

89.H8.sp6:130758.Seq 

89B9.sp6:130687.Seq 

RTA00000135A.1.2.2 

89.E9.sp6:130723.Seq 

RTAOOOOO 1 84 AF.d. 11.1 

RTAOOOOO 1 84AF.d. 11.1 .Seq_THC 1 6 1 896 

89.H9.sp6:130759.Seq 

RTA00000184AF.e.l5.1 

89.A10.sp6:130676.Seq 

RTAOOOOO 126A.m.4.2 

RTAOOOOO 1 84 AF.f.3.1 

89.B10.sp6:130688.Seq 
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Clone Name 


Cluster ID 


M0000 155 1 A:G06 


22390 


X jf A AA A 1 C C 1 a f~i f\f 

M0O00 1 55 1 A:G06 


22390 


x A f\f\f\f\ 1 c c 1 /"■" /\ A 

MOUOO 1 55 1 C:G09 


3266 


\4AAAA1 C C 1 /*"» /^» A A 

M0000 1551 C:G09 


3266 


iv a a a a a i c a t tf\/z 

M0000 1 553 A:H06 


8298 


M0000 1 553 A:H06 


8298 


M00001553B:F12 


4573 


X4AAAA1 in i-« -t 

MOO0O1553B:F12 


4573 


K iAAAA 1 C C C A T^rt'*» 

M0O0O 1 555 A:B02 


39539 


M0000 1 555A:B02 


39539 


X A f\f\t\f\ 1 £" C /" A y>i /*k -1 

M00001555A:C01 


39195 


XiAAAAl CCC a y^irtt 

M00001555A:C0I 


39195 


M00001555D:G10 


4561 


M0O0O1555D:G10 


4561 


M00001556A:C09 


9244 


M00001556A:C09 


9244 


M00001556B:G02 


1 1294 


M0O0O1556B:G02 


11294 


M00001557B:H10 


5192 


MOO0O1557B:H10 


5192 


M00001557B:H10 


5192 


M00001557D:D09 


8761 


M00001557D:D09 


8761 


M00001558B:H1 1 


7514 


MOO0O1558B:Hl 1 


7514 


M00001559B:F01 




M00001559B:F01 




M00001560D:F10 


6558 


M00001560D:F10 


6558 


M00001566B:D1 1 




M00001566B:D1 1 




M0O0O1583D:A10 


6293 


X A A f\t\f\ 1 f o^r\ a t r\ 

M0000 1 583D: A 1 0 


6293 


M0000 1 590B :F03 




X 4AAAA 1 C f~\ f\ r» 1—' /"\ ~% 

M00001590B:F03 




MAAAA1 CA*7r\ />/\^ 

M0000 1 597D:C05 


10470 


M0000 1 597D:C05 


10470 


X yf AAAA i rno a 

M0000 1 598A:G03 


16999 


X yf A AAA 1 CAO A - /^i m 

MOOOU 1 598 A.G03 


16999 


XyfAAAAt^AI A f\ f> 

M0000 1 60 1 A:D08 


22794 


X/fAAAA 1 Z" A 1 A T*\AO 

M0000 1 601 A:D08 


22794 


X /f A A A A 1 /T r\ O a l~« i i 

M00001607A:E1 1 


1 1465 


XyfAAAAl ZTA"7 A j— t 1 

M00001607A:E1 1 


1 1465 


X/fAAAA 1 /CAO A -T>A*^ 

MUOOO 1 60oA:B03 


7802 


X/fAAAAl / AO A f\ "I 

MOO0O1608A:B03 


7802 


M00001608B:E03 


22155 


M00001608B:E03 


22155 


M00001608D:A11 




M00001608D:A11 




M00001614C:F10 


13157 


M00001614C:F10 


13157 


M00001617C:E02 


17004 



Sequence Name 

RTA00000136AJ.13.1 

89.C10.sp6.130700.Seq 

RTA00000184AR.g.l.l 

89.D10.sp6:130712.Seq 

RTA00000127A.d.l9.1 

89.G10.sp6:130748.Seq 

89.H10.sp6:130760.Seq 

RTA00000184AF.h.9.1 

RTA00000127A.L21.1 

89.Bll.sp6:130689.Seq 

89.Cll.sp6:130701.Seq 

RTA00000137A.C.16.1 

RTA00000184AF.L21.1 

89.Dll.sp6:130713.Seq 

89.Ell.sp6:130725.Seq 

RTA00000127A.1.3.1 

RTA00000184AFJ.6.1 

89.A12.sp6:130678.Seq 

173.E9.SP6:134125.Seq 

RTA00000184AF.k.2.1 

89.D12.sp6:130714.Seq 

RTA00000184AF.k.l2.1 

89.E12.sp6:130726.Seq 

RTA00000184AF.k.21.1 

89.G12.sp6:130750.Seq 

89. H12.sp6:130762.Seq 
RTA00000184AF.I.11.1 

90. Al.sp6:130859.Seq 
RTA00000184AF.m.21 . 1 
RTA00000184AF.p.3.1 
90.Dl.sp6:130895.Seq 
RTA00000185AF.e.ll.l 
90.A2.sp6:130860.Seq 
RTA00O00185AF.g.ll.l 
90.C2.sp6:130884.Seq 
RTA00000185AF.k.6.1 
90.F2.sp6:130920.Seq 
90.G2.sp6:130932.Seq 
RTA00000185AF.k.9.1 
RTA00000138A.b.5.1 
90.H2.sp6:130944.Seq 
RTA00000185AF.m.l9.1 
90.A3.sp6:130861.Seq 
RTA00000185AF.n.5.1 
90.B3.sp6:130873.Seq 
RTA00000185AF.n.9.1 
90.C3.sp6:130885.Seq 
RTA00000185AF.n.l2.1 
90.D3.sp6:130897.Seq 
RTA00000186AF.a.6.1 
90.E3.sp6:130909.Seq 
RTA00000186AF.b.21.1 
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Clone Name Cluster ID 

M00001617C:E02 17004 

M000016I9C:F12 40314 

M00001619C:F12 40314 

M00001621C:C08 40044 

M00001621C:C08 40044 

M00001621C:C08 40044 

M00001621C:C08 40044 

M00001623D:F10 13913 

M00001623D:F10 13913 
M00001632D:H07 
M00001632D:H07 
M00001632D:H07 
M00001632D:H07 

M00001644C:B07 39171 

M00001644C:B07 39171 

M00001644C:B07 39171 

M00001645A:C12 19267 

M00001645A:C12 19267 

M00001645A:C12 19267 

M00001645A:C12 19267 

M00001648C:A01 4665 

M00001648C:A01 4665 

M00001657D:C03 23201 

M00001657D:C03 23201 

M00001657D:F08 76760 

M00001657D:F08 76760 

M00001662C:A09 23218 

M00001662C:A09 23218 

M00001663A:E04 35702 

M00001663A:E04 35702 

M00001669B:F02 6468 

M00001669B:F02 6468 

M00001670C:H02 14367 

M00001670C:H02 14367 

M00001673C:H02 7015 

M00001673C:H02 7015 

M00001675A:C09 8773 

M00001675A:C09 8773 

M00001675A:C09 8773 

M00001676B:F05 11460 

M00001676B:F05 11460 

M00001676B:F05 11460 

M00001677D:A07 7570 

M00001677D:A07 7570 

M00001677D:A07 7570 

M00001678D:F12 4416 

M00001678D:F12 4416 

M00001679A:F10 26875 

M00001679A:F10 26875 

M00001679B:F01 6298 

M00001679B:F01 6298 

M00001680D:F08 10539 



PCT/US98/27610 



Sequence Name 

90.F3.sp6:130921.Seq 

90.G3.sp6:130933.Seq 

RTA00000186AF.C.15.1 

RTA00000186AF.d.l.l 

RTA000001 86AF.d. 1 . 1 .Seq_THC232899 

90.H3.sp6:130945.Seq 

122.El.sp6:132121.Seq 

RTA00000186AF.e.6.1 

90.A4.sp6:130862.Seq 

RTA00000 1 86AF.h. 14.1 .Seq_THC 112525 

RTA00000186AF.h.l4.1 

90.E4.sp6:130910.Seq 

176.A3.sp6:134514.Seq 

RTA00000186AF.1.7.1 

90.F4.sp6:130922.Seq 

217.A12.sp6:139369.Seq 

RTA000001 86AF.1. 12. 1 .Seq_THCI 781 83 

176.G3.sp6:134586.Seq 

RTA00000186AF.1.12.I 

90.G4.sp6:130934.Seq 

90.H4.sp6:130946.Seq 

RTA00000 1 86 AF.m .3 . 1 

RTA00000187AF.a.l4.1 

90.B5.sp6:130875.Seq 

90.C5.sp6:130887.Seq 

RTA00000187AF.a.l5.1 

RTA00000187AR.C.5.2 

90.D5.sp6:130899.Seq 

90.E5.sp6:130911.Seq 

RTA000001 87AR.C. 1 5.2 

90.F5.sp6:130923.Seq 

RTA00000187AF.d.l5.1 

90.G5.sp6:130935.Seq 

RTA00000187AF.e.8.1 

90.H5.sp6:130947.Seq 

RTA0O000187AF.f.l8.1 

RTA00000187AF.f.24.1 

90.A6.sp6:130864.Seq 

RTA000001 87AF.f.24. 1 .Seq_THC220002 

RTA00000187AF.g.l2.1 

90.B6.sp6:130876.Seq 

219.F2.sp6:139035.Seq 

90.D6.sp6:130900.Seq 

RTA00000 1 87 AF.g.24. 1 

RTA000001 87AF.g.24. 1 .Seq_THC 1 68636 

90.E6.sp6:130912.Seq 

RTA00000187AF.h.l3.1 

RTA00000187AF.i.l.l 

90.A7.sp6:130865.Seq 

90.B7.sp6:130877.Seq 

RTA00000 1 8 7 AR. i . 1 0.2 

90.F7.sp6:130925.Seq 

133 



BNSDOCID: <WO 9933982 A2_l_> 



WO 99/33982 
cDNA Library ES 14 - ATCC# 
Deposit Date - December 22, 1 998 



PCT/US98/27610 



Clone Name 

M00001680D:F08 

M00001680D:F08 

M00001682C:B12 

M00001682C:B12 

M00001682C:B12 

M00001688C:F09 

M00001688C:F09 

M00001693C:G01 

M00001693C:G01 

M00001716D.-H05 

M00001716D:H05 

M00003741D:C09 

M00003741D:C09 

M00003747D:C05 

M00003747D:C05 

M00003747D:C05 

M00003747D:C05 

M00003754C:E09 

M00003754C:E09 

M0O0O3761D:A09 

M00003761D:A09 

M00003761D:A09 

M00003762C:B08 

M00003762C:B08 

M00003762C:B08 

M00003763A:F06 

M00003763A:F06 

M00003774C:A03 

M00003774C:A03 

M00003774C:A03 

M00003784D:D12 

MO00O3784D:D12 

M00003839A:D08 

M00003839A:D08 

M00003851B:D08 

M00003851B:D08 

M00003851B:D10 

M00003851B:D10 

M000O3853A:D04 

M00003853A:D04 

M00003853A:F12 

M00003853A:F12 

M00003856B:C02 

M00003856B:C02 

M00003857A:H03 

MO00O3857A:HO3 

MO0OO3857A:H03 



Cluster ID 

10539 

10539 

17055 

17055 

17055 

5382 

5382 

4393 

4393 

67252 

67252 

40108 

40108 

11476 

11476 

11476 

11476 



17076 

17076 

17076 

3108 

3108 

67907 

67907 

67907 



7798 
7798 



13595 

13595 

5619 

5619 

10515 

10515 

4622 

4622 

4718 

4718 

4718 



Sequence Name 

219.F6.sp6:139039.Seq 

RTA00000187AF.I.7.1 

90.G7.sp6:130937.Seq 

RTA00000187AF.m.3.1 

176.D6.sp6:134553.Seq 

90.A8.sp6:130866.Seq 

RTA00000187AF.m.23.2 

RTA00000187AF.n.l7.1 

90.B8.sp6:130878.Seq 

RTA00000187AF.O.6.1 

90.C8.sp6:130890.Seq 

90.D8.sp6:130902.Seq 

RTA00000187AF.O.24.1 

RTA00000187AF.p.l9.1 

90.E8.sp6:130914.Seq 

RTA000001 87AF.p. 19.1 .Seq_THC 1 08482 

219.H8.sp6:139065.Seq 

90.F8.sp6:130926.Seq 

RTA00000188AF.b.l2.1 

RTA00000188AF.d.ll.l 

90.H8.sp6:130950.Seq 

RTA00000 1 88AF.d. 11.1 .Seq_THC2 1 2094 

RTA00000188AF.d.21 .1 .Seq_THC208760 

90.A9.sp6:130867.Seq 

RTA00000188AF.d.21.1 

RTA00000188AF.d.24.1 

90.B9.sp6:130879.Seq 

RTAOOOOO 1 88AF.g. 11.1 .Seq_THC 1 23222 

RTA00000188AF.g.ll.l 

90.C9.sp6:130891.Seq 

RTA00000188AF.i.8.1 

90.D9.sp6:130903.Seq 

RTA00000189AF.C.18.1 

90.A10.sp6:130868.Seq 

90.D10.sp6:130904.Seq 

RTAOOOOO 189AF.f.7.1 

90.E10.sp6:130916.Seq 

RTA00000189AF.f.8.1 

90.F10.sp6:130928.Seq 

RTAOOOOO 1 89AF.f. 17.1 

90.G10.sp6:130940.Seq 

RTAOOOOO 189AF.f. 18.1 

90.H10.sp6:130952.Seq 

RTAOOOOO 189AF.g. 1.1 

90.Bll.sp6:130881.Seq 

RTAOOOOO 1 89AF.g.5 . 1 .Seq_THC 1 96 1 02 

RTA00000189AF.g.5.1 
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Clone Name 

M00003867A:D10 

M00003867A:D10 

M00003871C:E02 

M00003875C:G07 

M00003875C:G07 

M00003875D:D1 1 

M00003875D:D1 1 

M00003876D.-E12 

M00003876D:E12 

M00003906C:E10 

M00003906C:E10 

M00003907D:A09 

M00003907D:A09 

M00003907D:A09 

M00003907D:H04 

M00003907D:H04 

M00003909D:C03 

M00003909D:C03 

M00003968B:F06 

M00003968B:F06 

M00003970C:B09 

M00003970C:B09 

M00003970C:B09 

M00003974D:E07 

M00003974D:E07 

M00003974D:E07 

M00003974D:H02 

M00003974D:H02 

M00003974D:H02 

M00003981A:E10 

M00003981A:E10 

M00003982C:C02 

M00003982C:C02 

M00003982C:C02 

M00004028D:C05 

M00004028D:C05 

M00004035C:A07 

M00004035C:A07 

M00004035D:B06 

M00004035D:B06 

M00004072A:C03 

M00004072A:C03 

M00004081C:D10 

M00004081C:D10 

M00004086D:G06 

M00004086D:G06 

M00004105C:A04 

M00004105C:A04 

M00004171D:B03 

M00004171D:B03 

M00004185C:C03 

M00004185C-.C03 



Cluster ID 



4573 
8479 
8479 



7798 

7798 

9285 

9285 

39809 

39809 

39809 

16317 

16317 

8672 

8672 

24488 

24488 

40122 

40122 

40122 

23210 

23210 

23210 

23358 

23358 

23358 

3430 

3430 

2433 

2433 

2433 

40073 

40073 

37285 

37285 

17036 

17036 



15069 

15069 

9285 

9285 

7221 

7221 

4908 

4908 

11443 

11443 



Sequence Name 

90.C1 l.sp6:l 30893 .Seq 

RTA00000189AF.h.l7.1 

RTA00000189AF.j.l2.1 

90.Gll.sp6:130941.Seq 

RTA00000189AF.j.22.1 

90.Hll.sp6: 130953. Seq 

RTA00000189AF.j.23.1 

90A12.sp6:130870.Seq 

RTA00000189AF.k.l2.1 

90.H12.sp6:130954.Seq 

RTA00000190AF.d.7.1 

99Al.sp6:131230.Seq 

RTA000001 90AF.e.3. 1 .Seq_THCl 502 1 7 

RTA00000190AF.e.3.1 

99.Bl.sp6:131242.Seq 

RTA00000190AF.e.6.1 

RTA00000190AF.f.ll.l 

99.Cl.sp6:131254.Seq 

RTA00000190AF.n.l6.1 

99.C2.sp6:131255.Seq 

RTA00000190AF.n.23.1 

RTA000001 90AF.n.23 . 1 .Seq_THC 1 09227 

99.D2.sp6:131267.Seq 

RTA00000190AF.O.20.1 

RTA00000190AF.o.20.1.Seq_THC207240 

99.E2.sp6:131279.Seq 

RTA000001 90AF.O.2 1 . 1 .Seq_THC207240 

RTA00000190AF.O.21.1 

99.F2.sp6: 13 1291. Seq 

99.A3.sp6:131232.Seq 

RTA00000191AF.a.9.1 

RTA00000191AF.a.l5.2 

99.B3.sp6:131244.Seq 

RTA000001 91 AF.a. 1 5.2.Seq_THC79498 

RTA00000191AF.e.3.1 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

RTA00000191AF.f.ll.l 

RTA00000191AF.f.l3.1 

99.A4.sp6: 13 1233. Seq 

RTA00000191AF.j.9.1 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

RTA000001 91 AF. 1.6.1 

99.H4.sp6:131317.Seq 

RTA00000191AF.m.l8.1 

99.D5.sp6:131270.Seq 

RTA00000191AF.p.9.1 

RTA00000192AF.J.2.1 

99.F6.sp6:131295.Seq 

RTA00000192AF.1.13.2 

123.A8.sp6:132272.Seq 
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Clone Name 

M00004185C:C03 

M00004191D:B1 1 

M00004191D:B11 

M00004191D:B1 1 

M00004197D:H01 

M00004197D:H01 

M00004197D:H01 

M00004203B:C12 

M00004203B:C12 

M00004214C:H05 

M00004214C:H05 

M00004223D:E04 

M00004223D:E04 

M00004269D:D06 

M00004269D:D06 

M00004295D:F12 

M00004295D:F12 

M00004296C:H07 

M00004296C:H07 

M00004307C:A06 

M00004307C:A06 

M00004307C:A06 

M00004312A:G03 

M00004312A:G03 

M00004312A:G03 

M00004318C:D10 

M00004318C:D10 

M00004359B:G02 

M00004359B:G02 

MOOOO4505D:F08 

M00004505D:F08 

M00004692A:H08 

M00004692A:H08 

M00004692A:H08 

M00005180C:G03 



Cluster ID 
11443 



8210 

8210 

8210 

14311 

14311 

11451 

11451 

12971 

12971 

4905 

4905 

16921 

16921 

13046 

13046 

9457 

9457 

9457 

26295 

26295 

26295 

21847 

21847 



Sequence Name 

99A7.sp6:131236.Seq 

RTA00000192AF.m.l2.1 

99.B7.sp6:131248.Seq 

123.C8.sp6:132296.Seq 

99G7.sp6:131260.Seq 

123.E8.sp6:132320.Seq 

RTA00000192AF.n.l3.1 

99.D7.sp6:131272.Seq 

RTA00000192AF.O.2.1 

177.D8.sp6:134747.Seq 

RTA00000192AF.p.l7.1 

RTA00000193AF.a.20.1 

99.B8.sp6:131249.Seq 

99.H8.sp6:131321.Seq 

RTA00000193AF.e.l4.1 

99.D9.sp6:131274.Seq 

RTA00000193AF.h.l5.1 

99.E9.sp6:131286.Seq 

RTA00000193AF.h.l9.1 

RTA00000193AF.i.l4.2 

99.F9.sp6:131298.Seq 

123.Dll.sp6:132311.Seq 

RTA00000193AF.i.24.2 

99.G9.sp6:131310.Seq 

RTA000001 93 AF.i.24.2.Seq_THC 1 97345 

RTA00000193AF.j.9.1 

99.H9.sp6:131322.Seq 

RTA00000193 AF.m.5. 1 .Seq_THC 1 733 1 8 

RTA00000193AF.m.5.1 

RTA00000194AF.b.l9.1 

99.H10.sp6: 131 323. Seq 

99.Bll.sp6:131252.Seq 

RTA00000 1 94AF.C.24. 1 

377.F4.sp6:141957.Seq 

RTA00000194AF.f.4.1 
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Clone Name 

M00001346D:E03 

M00001350A:B08 

M00001350A:B08 

M00001357D:D11 

M00001357D:D11 

M00001409C:D12 

M00001409C:D12 

M00001418B:F03 

M00001418B:F03 

M00001418B:F03 

M00001418D:B06 

M00001421C:F01 

M00001421C:F01 

M00001429B:A11 

M00001432C:F06 

M00001439C:F08 

M00001442C:D07 

M00001442C:D07 

M00001443B:F01 

M00001443B:F01 

M00001445A:F05 

M00001445A:F05 

M00001446A:F05 

M00001455A:E09 

M00001455A:E09 

M00001460A:F12 

MO0O01481D:AO5 

M00001490B:C04 

M00001490B:C04 

M00001500C:E04 

M00001500C:E04 

M00001532B:A06 

M00001532B:A06 

M00001534A:F09 

M00001534A:F09 

M00001535A:B01 

M00001536A:C08 

M00001536A:C08 

M00001541A:F07 

M00001542B:B01 

M00001542B:B01 

M00001544A:E03 

M00001545A:CO3 

M00001545A:C03 

M00001545A:C03 

M00001548A:H09 

M00001548A:H09 

M00001548A:H09 

M00001549A:B02 

M00001549A:B02 

MO0001549A:DO8 

M00001552B:D04 



Cluster ID Sequence Name 

6806 RTA00000177AF.g.l3.3 

80.H2.sp6:130293.Seq 

RTA00000177AF.i.6.2 

4059 RTA000001 77AF.n. 1 8.3.Seq_THC 12305 1 

4059 RTA00000177AF.n.l8.3 

9577 RTA00000179AF.O.17.1 

9577 80.E7.sp6:130262.Seq 

9952 RTA00000180AF.C.20.1 

9952 RTA000001 80AF.C.20. 1 .Seq_THC 1 62284 

9952 80.E8.sp6:130263.Seq 

8526 RTA00000180AF.d.l.l 

9577 RTA00000180AF.d.23.1 

9577 80.G8.sp6:130287.Seq 

4635 RTA0OO0OI8OAF.i.20.1 

RTA00000180AF.k.24.1 

40054 RTA00000180AF.p.l0.1 

16731 RTA00000181AF.a.20.1 

16731 80.C10.sp6:130241.Seq 

80.D10.sp6:130253.Seq 

RTA00000181AF.b.7.1 

13532 80.E10.sp6:130265.Seq 

13532 RTA00000181AF.C.4.1 

7801 RTA00000181AF.C.21.1 

13238 RTA00000181AF.m.4.1 

13238 RTAOOOOO 1 8 1 AF.m.4. 1 .Seq_THC 1 4069 1 

39498 RTAOOOOO 119A.j.20.1 

7985 RTAOOOOO 1 82 AR.j. 2.1 

18699 RTAOOOOO 182AF.m. 16.1 

18699 89.D3.sp6:l 30705 .Seq 

9443 89.B4.sp6:130682.Seq 

9443 RTA00000183AF.C.1.1 

3990 89.G6.sp6:130744.Seq 

3990 RTAOOOOO 1 83 AF.j. 11.1 

5321 89.B7.sp6:130685.Seq 

5321 RTAOOOOO 1 83 AF.k.8.1 

7665 RTAOOOOO 134A.1. 19.1 

39392 89.G7.sp6:130745.Seq 

39392 RTAOOOOO 134A.m. 16.1 

22085 RTA00000135A.e.5.2 

RTAOOOOO 1 83 AF.p.4.1 

89.F8.sp6:130734.Seq 

12170 RTAOOOOO 125A.h. 18.4 

19255 RTA00000135A.m.l8.1 

19255 184.B10.sp6:135547.Seq 

19255 89.C9.sp6:130699.Seq 

1 058 RTAOOOOO 1 26A.e.20.3 .Seq_THC2 1 7534 

1058 RTAOOOOO 126A.e.20.3 

105C 79.F6.sp6: 13008 1. Seq 

4015 RTAOOOOO 136A.e. 12.1 

4015 79.G6.sp6:130093.Seq 

10944 RTAOOOOO 126A.h. 17.2 

5708 RTAOOOOO 184AF.g. 12.1 
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Clone Name 
M00001552B:D04 
M00001552D:A01 
M00001552D:A01 
M00001553D:D10 
M00001553D:D10 
M00001558A:H05 
M00001558A:H05 
M00O01561A:C05 
M00001561A:C05 
M00001564A:B12 
M00001578B:E04 
M00001579D:C03 
M00001579D:C03 
M00001579D:C03 
M00001582D:F05 
M00001587A:B11 
M00001587A:B11 
M00001604A:F05 
M00001604A:F05 
M00001624A:B06 
M00001624A:B06 
M00001624A:B06 
M00001630B:H09 
M00001630B:H09 
M00001630B:H09 
M00001651A:H01 
M00001651A:H01 
M00001677C:E10 
M00001679C:F01 
M00001679C:F01 
M00001679C:F01 
M00001686A:E06 
M00003796C:D05 
M00003796C:D05 
M00003826B:A06 
M00003826B:A06 
M0OOO3833A:E05 
M00O03837D:A01 
M00003837D:A01 
M00003846B:D06 
M00003846B:D06 
M00003879B:D10 
M00003879B:D10 
M00003879D:A02 
M00003879D:A02 
M00003891C:H09 
M00003891C:H09 
M00003912B:D01 
M00003912B:D01 
M00004072B:B05 
M00004081C:D12 
M00004111D:A08 



Cluster ID 
5708 



22814 
22814 



39486 

39486 

5053 

23001 

6539 

6539 

6539 

39380 

39380 

39391 

39391 

3277 

3277 

3277 

5214 

5214 

5214 



14627 

78091 

78091 

78091 

4622 

5619 

5619 

11350 

11350 

21877 

7899 

7899 

6874 

6874 

31587 

31587 

14507 

14507 



12532 
12532 
17036 
14391 
6874 



Sequence Name 

89.E10.sp6:130724.Seq 

89.F10.sp6:130736.Seq 

RTA00000184AF.g.22.1 

RTA00000184AF.h.l4.1 

89All.sp6:130677.Seq 

RTA00000128A.C.20.1 

89. F12.sp6:130738.Seq 
RTA00000128A.m.22.2 
79.B8.sp6:130035.Seq 
RTA00000184AF.O.12.1 
RTA00000185AF.C.24.1 

90. Gl.sp6:130931.Seq 
173.A12.SP6:134080.Seq 
RTA00000185AF.d.ll.l 
RTA00000185AF.d.24.1 
RTA00000129A.e.24.1 
79.E8.sp6:130071.Seq 
RTA00000138A.C.3.1 
79A9.sp6:130024.Seq 
RTA00000138A.1.5.1 
217.El.sp6:139406.Seq 
90.B4.sp6:130874.Seq 
90.D4.sp6:130898.Seq 
122.C2.sp6.132098.Seq 
RTA00000186AF.g.ll.l 
RTA00000186AF.n.7.1 
90A5.sp6:130863.Seq 
RTA00000187AF.g.23.1 
90.C7.sp6:130889.Seq 
RTA00000187AF.j.6.1 
176.G5.sp6:134588.Seq 
RTA00000 1 87 AF.m. 1 5 .2 
RTA000001 88AF.1.9. 1 .Seq_THC 1 67845 
RTA00000188AF.1.9.1 
RTA00000189AF.a.24.2 
9O.F9.sp6:130927.Seq 
RTA00000189AF.D.21.1 
90.H9.sp6:130951.Seq 
RTA00000189AF.C.10.1 
RTA00000189AF.e.9.1 
90.C10.sp6:130892.Seq 
RTA00000189AF.I.20.1 
90.C12.sp6:130894.Seq 
90.D12.sp6:130906.Seq 
RTA00000189AR. 1.23.2 
90.G12.sp6:130942.Seq 
RTA00000189AF.p.8.1 
99.Dl.sp6:131266.Seq 
RTA00000190AF.g.2.1 
RTA00000191AF.j.l0.1 
RTA00000191AF.1.7.1 
RTA00000192AF.a.l4.1 
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cDNA Library ESI 6 - ATCC# 
Deposit Date - December 22, 1998 
Clone Name Cluster ID 

M00004111D:A08 6874 
M00004121B:G01 
M00004121B:G01 
M00004121B:G01 
M00004138B:H02 13272 
M00004138B:H02 13272 
M00004151D:B08 16977 
M00004169C:C12 5319 
M00004169C:C12 5319 
M00004169C:C12 5319 
M00004183C:D07 16392 
M00004183C:D07 16392 
M00004230B:C07 7212 
M00004230B:C07 7212 
M00004249D:F10 
M00004249D:F10 
M00004275C:C1 1 16914 
M00004275C:C1 1 16914 
M00004283B:A04 14286 
M00004285B:E08 56020 
M00004327B:H04 
M00004377C:F05 2102 
M00004384C:D02 
M00004384C:D02 
M00004461A:B08 
M00004461A:B09 
M00004691D:A05 
M00004896A:C07 



PCT/US98/27610 



Sequence Name 

99.F5.sp6:131294.Seq 

177.H4.sp6:134791.Seq 

99.H5.sp6:131318.Seq 

RTA00000192AF.C.2.1 

99.A6.sp6:131235.Seq 

RTA00000192AF.e.3.1 

RTA00000192AF.g.3.1 

99.E6.sp6: 131 283. Seq 

RTAOOOOO 192AF.i. 12.1 

1 23 .F7.sp6: 132331. Seq 

RTA00000192AF.1.1.1 

RTA00000192AF.1.1.1. Seq_THC20207 1 

RTA00000193AF.b.l4.1 

99.D8.sp6:131273.Seq 

RTAOOOOO 1 93 AF.c .2 1 . 1 .Seq_THC222602 

RTA00000193AF.C.21.1 

99.A9.sp6:131238.Seq 

RTA00000193AF.f.5.1 

RTAOOOOO 1 93 AF.f.22.1 

RTAOOOOO 1 93 AF.g.2.1 

RTA00000193AF.j.20.1 

RTAOOOOO 1 93 AF.n.7.1 

RTAOOOOO 193AF.n. 15.1 

RTAOOOOO 1 93 AF.n. 15.1 .Seq_THC2 1 5687 

RTAOOOOO 1 94 AR.a. 1 0.2 

RTAOOOOO 1 94 AF.a. 11.1 

RTAOOOOO 194AF.C.23.1 

RTAOOOOO 194AF.d. 13.1 



The above material has been deposited with the American Type Culture Collection, 
Rockville, Maryland, under the accession number indicated. This deposit will be maintained 
5 under the terms of the Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for purposes of Patent Procedure. The deposit will be maintained for a 
period of 30 years following issuance of this patent, or for the enforceable life of the patent, 
whichever is greater. Upon issuance of the patent, the deposit will be available to the public 
from the ATCC without restriction. 
10 This deposit is provided merely as convenience to those of skill in the art, and is not 

an admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid sequence 
of the polypeptides encoded thereby, are incorporated herein by reference and are controlling 
in the event of any conflict with the written description of sequences herein. A license may 
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be required to make, use, or sell the deposited material, and no such license is granted 
hereby. 

Retrieval of Individual Clones from Deposit of Pooled Clones 



prepared by first transfecting each of the clones into separate bacterial cells. The clones 
were then deposited as a pool of equal mixtures in the composite deposit. Particular clones 
can be obtained from the composite deposit using methods well known in the art. For 
example, a bacterial cell containing a particular clone can be identified by isolating single 

10 colonies, and identifying colonies containing the specific clone through standard colony 
hybridization techniques, using an oligonucleotide probe or probes designed to specifically 
hybridize to a sequence of the clone insert (e.g., a probe based upon unmasked sequence of 
the encoded polynucleotide having the indicated SEQ ID NO). The probe should be 
designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 

15 each G or C). Positive colonies can then be picked, grown in culture, and the recombinant 
clone isolated. Alternatively, probes designed in this manner can be used to PCR to isolate a 
nucleic acid molecule from the pooled clones according to methods well known in the art, 
e.g., by purifying the cDNA from the deposited culture pool, and using the probes in PCR 
reactions to produce an amplified product having the corresponding desired polynucleotide 

20 sequence. 



5 



Where the ATCC deposit is composed of a pool of cDNA clones, the deposit was 
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Table 1. Sequence identification numbers, cluster ID, sequence name, and clone name 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


1 


4635 


RTA00000 1 80AF.i.20. 1 


M00001429B:A11 


2 




RTA00000 1 85 AF.n. 12.1 


M00001608D:A11 


3 


4622 


RTA00000 1 87AF.m. 1 5 .2 


M00001686A:E06 


4 


3706 


RTAOOOOO 1 9 1 AF.i. 1 7.2 


M00004068B:A01 


5 


36535 


RTA00000 1 8 1 AF f 5 1 


M00001449A:G10 


6 


3990 


RTAOOOOO 1 83 AF.j . 1 1 . 1 


M00001532B:A06 


7 


5319 


RTAOOOOO 1 92 AF.i. 12. 1 


M00004169C:C12 


8 


36393 


RTAOOOOO 180AF.C.2.1 


M00001417A:E02 


9 


2623 


RTAOOOO0 1 83 AF a 6 1 


M00001497A:G02 


10 


7587 


RTAOOOOO 1 78 AF n 24 1 


M00001387B:G03 


1 1 


7065 


RTAOOOOO 1 3 7 A e 6 1 


M00001557A:D02 


1 2 

1 JL. 


10539 


RTAOOOOO 1 87 AF I 7 1 


M00001680D:F08 


1 J 


27250 


RTAOOOOOI 81 AF e 10 1 


M00001450A:D08 


14 

I T 


5556 


RTAOOOOOI 79AF n 10 1 


M00001407B:D1 1 






RTAOOOOOI 92 AF m 12 1 


M00004191D:B1 1 


16 


8761 


RTAOOOOOI 84 AF k 12 1 

IV I rVuuvvv 1 O^Al .IV. li.i 


M00001557D:D09 


17 


4622 


RTAOOOOOI 89 AF e 1 1 

IV I /Y V/ V/ V7 V/V7 1 O 7ill 1.1 


M00003856B:C02 


18 

I o 


1 1460 


RTAOOO0O1 87AF e 12 1 


M00001676B:F05 


19 


16283 


RTAOOOOO 120A o 20 1 


M00001467A:D08 


20 


3430 


RTAOOOOO 191 AF a 9 1 

IV I / vv vv/vv 171 z».i -d. y - i 


M00003981A:E10 


21 


7065 


RTAOOOOO 1 84 AF i 21 1 


M00001557A:D02 


22 




RTAOOOOOI 82 AF 1 20 1 


M00001488B:F12 


23 




RTAOOOOO 123 A e 19 1 


M0O0O1531A:Hll 


24 


16918 


RTAOOOOO 1 93 AF a 16 1 


M00004223A:G10 


25 


16914 


RTAOOOOO 1 93 AF f 5 1 


M00004275C:C1 1 


26 


40108 


RTAOOOOOI 87 AF.o.24. 1 


M00003741D:C09 


27 


14286 


RTAOOOOOI 93 AF.f.22.1 


M00004283B:A04 


28 


17004 


RTAOOOOO 1 86AF.b.2 1 . 1 


M00001617C:E02 


29 




RTAOOOOO 1 80AF.g.22. 1 


M00001426B:D12 


30 


13272 


RTAOOOOO 1 92 AF.e.3 . 1 


M00004138B:H02 


31 




RTAOOOOOI 94 AF.f.4.1 


M0OOO518OC:G03 


32 


32663 


RTAOOOOOI 18A.1. 8.1 


M00001450A:A11 


33 




RTAOOOOO 1 80AF.a.9. 1 


M00001414A:B01 


34 


5832 


RTAOOOOO 1 78 AF o 23 1 

IV I rVVUl/V/v 1 / Dill .v/.X. ~f . 1 


M00001388D:G05 


35 


7801 


RTAOOOOOI 8 lAF.c. 2 1.1 


M00001446A:F05 


36 


76760 


RTAOOOOOI 87AF.a. 15.1 


M00001657D:F08 


37 


40132 


RTAOOOOOI 78AF.C.7.1 


M00001365C:C10 


38 




RTAOOOOOI 83 AF.e. 1.1 


M00001505C:C05 


39 


4016 


RTAOOOOOI 18A.C.4.1 


M00001395A:C03 


40 


5382 


RTAOOOOO 187AF.m.23 .2 


M00001688C:F09 
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SEQ ID NO: 


Cluster ID 


Sequence Name 


41 


5693 


RTA00000 1 90AF.p. 1 7.2 


42 


307 


RTAOOOOO 136A.o.4.2 


43 


39833 


RTA00OOO 1 78AF.i.23. 1 


44 




RTA0O0OO 1 93 AF.m.5. 1 


45 


5325 


RTA00000191AF.O.6.1 


46 


5325 


RTA000OO 1 9 1 AF .o.6.2 


47 


18957 


RTA00000 1 90AR.m.9. 1 


48 


39508 


RTAOOOOO 1 20 A.o.2.1 


49 


22390 


RTA00000136AJ.13.1 


50 


12170 


RTA00000 1 25 A.h. 1 8.4 


51 


4393 


RTA00000 1 87AF.n. 1 7. 1 


52 


19 


RTAOOOOO 1 82AF.b.7. 1 


53 




RTA00000 1 93 AFx.2 1 . 1 


54 


7899 


RTAOOOOO 1 89AF.C. 1 0. 1 


55 


40073 


RTA00000191AF.e.3.1 


56 


7005 


RTAOOOOO 179AF.o.22.1 


57 




RTAOOOOO 187AF.h.22.1 


58 


18957 


RTAOOOOO 190AF.m.9.2 


59 


18957 


RTAOOOOO 1 83 AF.h.23.1 


60 


16283 


RTAOOOOO 1 82AF.C.22. 1 


61 


6974 


RTAOOOOO 1 83 AF.d.9.1 


62 


2623 


RTAOOOOO 183AF.b. 14.1 


63 


9105 


RTA00000191AF.a.21.2 


64 


13238 


RTA00000181AF.m.4.1 


65 


5749 


RTAOOOOO 185AF.a. 19.1 


66 


6455 


RTAOOOOO 1 93 AF.b.9.1 


67 


23001 


RTAOOOOO 1 85 AF.c.24.1 


68 


6455 


RTAOOOOO 1 92 AF.g.23 . 1 


69 


13595 


RTAOOOOO 1 89AF.f.8. 1 


70 


39442 


RTAOOOOO 120A.O.2 1.1 


71 


17036 


RTAOOOOO 1 9 1 AF.f. 1 3 . 1 


72 




RTAOOOOO 1 83 AF.g.9.1 


73 


7005 


RTAOOOOO 1 8 1 AF.k.24. 1 


74 


6268 


RTAOOOOO 1 26 A.o.23 . 1 


75 


16130 


RTAOOOOO 1 19A.C.13.1 


76 


23201 


RTAOOOOO 1 87AF.a. 1 4. 1 


77 


5321 


RTAOOOOO 1 83 AF.k.8.1 


78 


13157 


RTAOOOOO 1 86 AF.a.6. 1 


79 


2102 


RTAOOOOO 1 93 AF.n.7.1 


80 


1058 


RTAOOOOO 1 26 A.e.20.3 


81 


40392 


RTAOOOOO 180AF.j.8.1 


82 




RTAOOOOO 1 83 AF.e.23.1 


83 


11476 


RTAOOOOO 187AF.p. 19.1 
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Clone Name 

M00003978B:G05 

M00001552A:B12 

M00001378B:B02 

M00004359B:G02 

M00004093D:B12 

M00004093D:B12 

M00O03958A:H02 

M00001467A:D04 

M00001551A:G06 

M0000]544A:E03 

M00001693C:G01 

M00001463C:B11 

M00004249D:F10 

M00003837D:A01 

M00004028D:C05 

M00001410A:D07 

M00001679A:F06 

M00003958A:H02 

M00001528A:F09 

M00001467A:D08 

M00001504C:H06 

M00001500A:E11 

M00003983A:A05 

M00001455A:E09 

M00001571C:H06 

M00004229B:F08 

M00OO1578B:E04 

M00004157C:A09 

M00003851B:D10 

M00001467A:E10 

M00004035D:B06 

M00OO1513B:G03 

M00001454B:C12 

M00001551A:B10 

M00001453A:E11 

M00001657D:C03 

M00001534A:F09 

M00001614C:F10 

M00004377C:F05 

M00001548A:H09 

M00001429D:D07 

M00001506D:A09 

M00003747D:C05 
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SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


84 


3584 


RTA00000 1 77AF.h.20. 1 


M00001349B:B08 


85 


10470 


RTA0OO00 1 80AF.f. 1 8. 1 


M00001424B:G09 


86 


39425 


RTA00000133A.f.l.l 


M00001470A:C04 


87 


5175 


RTA00000 1 84AF.f.3. 1 


» 1AAAA t C C d\ A f\ 1 

M00001550A:G01 


88 


13576 


RTA00000 1 89AF.0. 13.1 


» / AArtA^ oof r> a i\ 

M00003885C:A02 


89 


7665 


RTA00000134A.1.19.1 


M000O1535A:BOl 


90 


16927 


RTA000001 77AF.h.9.3 


M00001 348B:B04 


91 


6660 


RTA00000 1 87 AF.h. 15.1 


M00001 679A:A06 


92 


2433 


RTAOOOOO 1 9 1 AF.a. 1 5.2 


M00003982C:C02 


93 


5097 


RTA00000134A.k.l.l 


» * AAAA 1 /~ "I A A T~~\ C\ f\ 

MOOOOl 534A:D09 


94 


21847 


RTA00000193AF.j.9.1 


M000043 1 8C:D1 0 


95 


3277 


RTAOOOOO 138A.1.5.1 


MOOOOl 624A:B06 


96 


5708 


RTAOOOOO 1 84 AF.g. 12.1 


MOOOOl 552B:D04 


97 


945 


RTAOOOOO 1 78AR.a.20. 1 


v sr r\ at\ y\ s\ t **% a*" ¥ ¥ 1 1 

MOO0Ol362C:Hl l 


98 


16269 


RTAOOOOO 1 78 AF.p. 1.1 


MOOOOl 389A:C08 


99 




RTAOOOOO 1 83 AF.c.24.1 


MOOOOl 504A:E0l 


100 


16731 


RTAOOOOO 1 81 AF.a.20.1 


M0000l442C:D07 


101 


12439 


RTAOOOOO 1 90AF.O.24. 1 


M00003975A:Gl l 


102 


3162 


RTAOOOOO 1 77AF.j. 1 2.3 


MOOOOl 35 1B:A08 


103 




RTAOOOOO 1 94AF.b. 1 9. 1 


M00004505D:F08 


104 




RTAOOOOO 193AF.n. 15.1 


M00004384C:D02 


105 




RTAOOOOO 186AF.n.7.1 


M00001651 A:H01 


106 


10717 


RTA00000181AF.d.l0.1 


MOOOOl 447A:G03 


107 


4573 


RTAOOOOO 189AF.j. 12.1 


M00003871C:E02 


108 




RTAOOOOO 186 AF.h. 14.1 


MOOOOl 632D:H07 


109 


11443 


RTAOOOOO 192AF.1. 13.2 


M00004185C:C03 


110 


5892 


RTA00000184AF.d.ll.l 


M00001548A:E10 


111 


3162 


RTAOOOOO 1 77AFJ.1 2. 1 


M00001351B:A08 


112 


10470 


RTAOOOOO 1 85 AF.k.6.1 


MOOOOl 597D:C05 


113 


17055 


RTAOOOOO 187AF.m.3.1 


M00001682C:B12 


114 


2030 


RTAOOOOO 1 93 AF.m.20.1 


M00004372A:A03 


115 


6558 


RTAOOOOO 1 84AF.m.21 . 1 


MOOOOl 560D:F10 


116 


23255 


RTAOOOOO 190AF.j.4.1 


M00003922A:E06 


117 


9577 


RTAOOOOO 1 79AF.0. 17.1 


MOOOOl 409C:Dl 2 


118 




RTAOOOOO 1 80AF.a. 11.1 


M0000 1 4 1 4C : AO/ 


119 


8 


RTAOOOOO 1 8 1 AF.e. 1 7. 1 


MOOOOl 448D:C09 


120 


67907 


RTAOOOOO 188 AF.g. 11.1 


M00003774C:A03 


121 


12081 


RTAOOOOO 133A.d. 14.2 


A 4 A AAA 1 ^ / A A ./^ 1 A 

MOOOO 1 469A:C 1 0 


122 


2448 


RTA00000119AJ.21.1 


A 1AAAA1 A y /\ A ¥~"/\^T 

MOOOOl 460A:F06 


123 


3389 


RTAOOOOO 189AF.g.3.1 


MOO0O3857A:G10 


124 


39174 


RTAOOOOO 124A.n. 13.1 


MOOOOl 54 1A:H03 


125 


24488 


RTAOOOOO 190AF.n. 16.1 


M00003968B:F06 


126 


8210 


RTAOOOOO 192AF.n. 13.1 


M00004197D:H01 
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SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


127 




RTA00000135A.1.2.2 


M00001545A:B02 


128 


40455 


RTA00O00 1 90AF.m . 1 0.2 


M00003958C:G10 


129 


9577 


RTA00000180AF.d.23.1 


M00001421C:F01 


130 


13183 


RTA00000192AF.a.24.1 


M00004114C:F11 


131 


5214 


RTA00000186AF.g.ll.l 


M00001630B:H09 


132 


67252 


RTA00000187AF.O.6.1 


M00001716D:H05 


133 


3108 


RTA00000188AF.d.24.1 


M00003763A:F06 


134 


2464 


RTA00000178AF.n.l8.1 


M00001387A:C05 


135 


36313 


RTA00000181AF.e.23.1 


M00001448D.H01 


136 


23255 


RTA00000177AF.e.l4.3 


M00001343D:H07 


137 


7985 


RTA00000182AR.J.2.1 


M00001481D:A05 


138 


8286 


RTA00000183AF.O.1.1 


M00001540A:D06 


139 


22195 


RTA00000180AF.g.7.1 


M00001425B:H08 


140 


4573 


RTA00000184AF.h.9.1 


M00001553B:F12 


141 


26875 


RTA00000187AF.U.1 


M00001679A:F10 


142 


7187 


RTA00000177AF.L8.2 


M00O0135OA:H01 


143 


86859 


RTA00000118A.p.8.1 


M00001452A:B12 


144 


4623 


RTA00000185AF.f.4.1 


M00001586C:C05 


145 




RTA00000121A.C.10.1 


M00001469A:A01 


146 


10185 


RTA0OOO0183AF.d.5.1 


M00001504C:A07 


147 




RTA00000183AF.p.4.1 


M00001542B:B01 


148 


15069 


RTA00000191AF.1.6.1 


M00004081C:D10 


149 


39304 


RTA00000118A.j.21.1 


M00001450A:A02 


150 


8672 


RTA00000190AF.f.ll.l 


M00003909D:C03 


151 


13576 


RTA00000177AF.g.l6.1 


M00001347A:B10 


152 


6293 


RTA0O0O0185AF.e.ll.l 


M00001583D:A10 


153 


16977 


RTA00000192AF.g.3.1 


M00004151D:B08 


154 


5345 


RTA00000189AF.1.19.1 


M00003879B:C11 


155 


4905 


RTA00000193AF.e.l4.1 


M00004269D:D06 


156 


17036 


RTA00000191AF.j.l0.1 


M00004072B:B05 


157 


5417 


RTA00000191AF.h.l9.1 


M00004059A:D06 


158 


7172 


RTA00000178AF.f.9.1 


M00001371C:E09 


159 


40044 


RTA00000186AF.d.l.l 


M00001621C:C08 


160 


4386 


RTA00000184AF.J.4.1 


MOOO01556B:C08 


161 


40044 


RTA00000183AF.g.22.1 


M00001514C:D11 


162 


9685 


RTA00000183AF.C.11.1 


M00001501D:C02 


163 


22155 


RTAOO0O0185AF.n.9.1 


M00001608B:E03 


164 


10515 


RTA00000189AF.f.l8.1 


M00003853A:F12 


165 


6539 


RTA00000185AF.d.ll.l 


M00001579D:C03 


166 


15066 


RTA00000180AF.e.24.1 


M00001423B:E07 


167 


4261 


RTA00000180AF.h.5.1 


M00001426D:C08 


168 


13864 


RTA00000125A.m.9.1 


M00001545A:D08 


169 


6539 


RTA00000189AF.d.22.1 


M00003844C:B1 1 
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WO 99/33982 
SEQ ID NO: Cluster ID 

170 11465 

171 3266 

172 102 

173 16970 

174 12971 

175 5007 

176 3765 

177 11294 

178 3681 

179 9283 

180 18699 

181 86110 

182 39648 

183 7337 

184 1334 

185 17076 

186 22794 

187 39171 

188 8551 

189 5857 

190 9443 

191 9457 

192 7206 

193 22979 

194 40455 

195 7221 
196 

197 7239 

198 31587 

199 16317 

200 13576 

201 5779 

202 6124 

203 9952 
204 

205 5779 

206 39490 

207 4416 

208 4009 

209 5336 

210 39186 

211 40122 

212 12532 



Sequence Name 

RTA00000185AF.m.l9.1 

RTA00000184AR.g.l.l 

RTA00000184AF.O.5.1 

RTA00000181AR.i.l8.2 

RTA00000193AF.a.20.1 

RTA00000177AF.g.2.1 

RTA00000135A.d.l.l 

RTA00000184AF.j.6.1 

RTA00000131A.g.l5.2 

RTA00000181AR.m.21.2 

RTA00000182AF.m.l6.1 

RTA00000181AF.f.l2.1 

RTA00000178AR.1.8.2 

RTA00000123A.b.l7.1 

RTA00000178AF.j.7.1 

RTA00000188AF.d.21.1 

RTA00000138A.b.5.1 

RTA00000186AF.1.7.1 

RTA00000179AF.p.21.1 

RTA00000118A.g.l4.1 

RTA00000183AF.C.1.1 

RTA00000193AF.U4.2 

RTA00000182AF.O.15.1 

RTA00000178AF.L22.1 

RTA00000190AR.m.l0.1 

RTA00000191AF.p.9.1 

RTA00000191AF.j.9.1 

RTA00000126A.m.4.2 

RTA00000189AF.1.20.1 

RTA00000190AF.e.6.1 

RTA00000189AR.O.13.1 

RTA00000177AF.g.l4.3 

RTA00000191AR.e.2.3 

RTA00000180AF.C.20.1 

RTA00000188AF.L8.1 

RTA00000177AF.g.l4.1 

RTA00000128A.b.4.1 

RTA00000187AF.h.l3.1 

RTA00000179AF.e.20.1 

RTA00000183AF.b.l3.1 

RTA00000121A.p.l5.1 

RTA00000190AF.n.23.1 

RTA00000190AF.g.2.1 
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PCT/US98/27610 
Clone Name 

M00001607A:E1 1 

M00001551C:G09 

M00001563B:F06 

M00001452C:B06 

M00004223D:E04 

M00001346A:F09 

M00001541A:D02 

M00001556B:G02 

M00001449A:D12 

M00001455D:F09 

M00001490B:C04 

M00001449C:D06 

M00001383A-.C03 

M00001528A:C04 

M00001379A:A05 

M00003762C:B08 

M00001601A:D08 

M00001644C:B07 

M00001412B:B10 

M00001449A:A12 

M00001500C:E04 

M00004307C:A06 

M00001494D:F06 

M00001382C:A02 

M00003958C:G10 

M00004105C:A04 

M00004072A:C03 

M00001550A:A03 

M00003879B:D10 

M00003907D:H04 

M00003885C:A02 

M00001346D:G06 

M00004028D:A06 

M00001418B:F03 

M00003784D:D12 

M00001346D:G06 

M00001557A:F03 

M00001678D:F12 

M00001396A:C03 

M00001500A:C05 

M00001512A:A09 

M00003970C:B09 

M00003912B:D01 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 




SEQ ID NO 


Cluster ID 


Sequence Name 


Clone Name 


213 


8078 


RTAO0OO0 1 77AR. 1. 13.1 


M00001353A:G12 


214 


3900 


RTAO0O00 1 90 AF.g. 13.1 


M00003914C:F05 


215 


7589 


RTAO0000 1 20 A .p.23 . 1 


M00001468A:F05 


216 


8298 


RTA00000 1 27A.d. 19.1 


M00001553A:H06 


217 


4443 


RTA00000 1 77 AF.b.20.4 


M00001341A:E12 


218 


26295 


RTA00000 1 93 AF.i.24.2 


M00004312A:G03 


219 


3389 


RTA00000 1 83 AF.m. 1 9. 1 


M00001537B:G07 


220 


7015 


RTA00000 1 87AF.f. 18.1 


M00001673C:H02 


221 


8526 


RTA 00000 1 80AF.d. 1 . 1 


M00001418D:B06 


222 


4665 


RTA00000 1 86AF.m.3 . 1 


M00001648C:A01 


223 


1399 


RTA00000 1 29A.0. 1 0. 1 


M00001604A:B10 


224 


9244 


RTA00000127A.1.3.1 


M00001556A:C09 


225 




RTA00000 1 79 A Fj. 1 3. 1 


M00001400B:H06 


226 


82498 


RTA000001 18A.m.l0.1 


M00001450A:B12 


227 


35702 


RTA00000 1 87AR.C. 1 5 .2 


M00001663A:E04 


228 


38759 


RTA00000120A.m.l2.3 


M00001467A:B07 


229 


39648 


RTA00000178AF.1.8.1 


M00001383A:C03 


230 


19105 


RTA00000133A.e.l5.1 


M00001469A:H12 


231 


85064 


RTA00000 1 3 1 A.m.23 . 1 


M00001452A:F05 


232 


9285 


RTA00000 1 9 1 AF.m. 1 8. 1 


M00004086D:G06 


233 


9285 


RTA00000 1 90AF.d.7. 1 


M00003906C:E10 


234 


39391 


RTA00000138A.C.3.1 


M00001604A:F05 


235 




RTA00000 1 78AF.d.20. 1 


M00001368D:E03 


236 


39498 


RTAOOOOO 119A.J. 20.1 


M00001460A:F12 


237 


7798 


RTAOOOOO 1 89AF.k. 1 2. 1 


M00003876D:E12 


238 


7798 


RTA00O00 1 89AF.C. 18.1 


MOOO03839A:D08 


239 


19829 


RTAOOOOO 1 25 A.h.24.4 


M00001544A:G02 


240 




RTAOOOOO 1 88 AF.d. 11.1 


M00003761D:A09 


241 


4275 


RTAOOOOO 1 20 A j .14.1 


M00001466A:E07 


242 


221 13 


RTAOOOOO 1 25 A.c.7.1 


M00001542A:A09 


243 


40314 


RTAOOOOO 1 86AF.C. 15.1 


M00001619C:F12 


^ A A 

244 


10944 


RTAOOOOO 1 26A.h. 1 7.2 


M00001549A:D08 


245 


39809 


RTAOOOOO 1 90AF.e.3 . 1 


M00003907D:A09 


24o 


n nor 

22085 


RTAOOOOO 1 35 A.e.5.2 


M00001541A.F07 


24 / 


19255 


RTAOOOOO 1 3 5 A.m . 1 8 . 1 


M00001545A:C03 


^ AO 

248 


1431 1 


RTAOOOOO 1 92AF.0.2. 1 


M00004203B:C12 


^ /irk 
249 


8479 


RTAOOOOO 1 89AF.j. 22.1 


M00003875C:G07 


250 




RTAOOOOO 1 89AF.j. 23.1 


M00003875D:D11 


251 


4193 


RTAOOOOO 1 84AF.e. 13.1 


M00001549B:F06 


252 


22814 


RTAOOOOO 1 84AF.h. 14.1 


M00001553D:D10 


253 


39563 


RTA 000001 79 AF.k.20.1 


M00001402A:E08 


254 


39420 


RTAOOOOO 1 34 A.o.23.1 


M00001537A:F12 


255 


11589 


RTAOOOOO 177AF.b. 17.4 


M00001340D:F10 
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BNSDOCID: <WO 9933982A2_I_> 




PCT/US98/27610 



SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


256 


4937 


RTA00000 1 9 1 AF.p.2 1 . 1 


■» AAAAA A 1 AO A .CAiC 

M00004 1 08A: fcuo 


257 


39412 


RTA00000 1 3 3 A.k. 17.1 


X jfAAAAl C 1 1 A . I I A/ N 

M0000151 lA:HUo 


258 


4837 


T" A AAA/"VA* O A Tk 1 O s"\ 

RTA00000185AR.k.3.2 


a 4AAAA1 cmr'.uA') 

M0000 1 597C:HU2 


259 


13046 


T*k T* A AAAAA1 A"> A 1_ 1 A 1 

RTA0O000 1 93 AF.h. 19.1 


M000U429oChiU / 


260 


4141 


r> t* A AAAAA 1 11 A T" AA "> 

RTA000001 77AF.p.20.3 


MUUUU 1 3o 1 A. AUj 


261 


38085 


T» T" A AAAAA1 A _ 1 C 1 

RTA00000 1 23 A.e. 15.1 


MUUUU 1 D 3 1 A.DU 1 


262 




Ti T* A AAAAA 1 OA AT Ol 

RTAOO0OO 1 89AF.p.8. 1 




263 


1 1451 


T> T* A AAAAA 1 A'* AT- 1*7 1 

RTA00000 1 92AF.p. 17.1 


X if AAAA/fA 1 yi/^.LJAC 

MUUUU42 HL.HUj 


264 


14507 


T"» T" A AAAAA1 OA A 1~) 1 11 A; 

RTA00000189AR.1.23.2 


MUUUUio /yLl.AUz 


265 


40054 


T"% T" AAAAAAtOAAT"" 1A 1 

RTA00000 1 80AF.p. 10.1 


XyfAAAAl /1 1 Ai^ . TTAO 


266 


39423 


T"» A AAAAA 1 *> A A 1 1 

RTAOOOOO 134A.k.22.1 


K A AAA A 1 CTC A .TTTA 

MOOuO 1 535A:r 1 U 


267 


39453 


RTA0O0OOl35A.g.l l.l 


A if A A A A 1 f /II A „ "T7 A /T 

M0000 1 542A:b0o 


268 


10751 


RTA0O000 1 87AF.K.7. 1 


X AAAAA 1 znAT"\. AA*) 

M0000 1 679D:D03 


269 


10751 


RTAOOOOO 1 87 AF.k.6.1 


X AAAAA 1 /"7Ar\ .r\A1 

M0O00 1 679D:D03 


270 


78091 


RTAOOOOO 187AF.J.6.1 


X AAAAA 1 Z' 1C\f** .TA1 

M0000l679C:F0l 


271 


39539 


RTA0O000 1 27 A. 1 .21.1 


X AAAAA 1 C C C A . DA'I 

M0000 1 555A:B02 


272 




RTA00000 1 82 AF.L 1 5 . 1 


X AAAAA 1 A O TT* T IAZT 

M00001487B:H06 


273 




RTA00000 1 94AF.d. 13.1 


XAAAAAAOAZT A . A*7 

M00004896A:C07 


274 




RTA00000128A.C.20.1 


X AAAAA1 CCO A . T T A C 

M00001 558A:H05 


275 


9283 


RTA00000181AR.m.22.2 


X AAAAA 1 /( ff A.PAA 

M0000 1 455D:F09 


276 


39168 


RTA00000 1 2 1 A. ! . 1 0 . 1 


X AAAAA 1 C A^7 A T TAC 

M0O00 1 507A:H05 


277 


39458 


RTAOOOOO 1 26 A. p. 1 5 .2 


X AAAAA 1 CC~\ A At 1 

M00001 552A:D1 1 


278 


14391 


RTAOO000 1 77AF.m. 1 7.3 


X AAAAA IOC CT> . /"^ 1 A 

M00001355B:Glu 


279 


39195 


RTAOOOOO 1 3 7A.c. 16.1 


X AAAAA 1 C C C A ./""•A 1 

M00001 555A:C0l 


280 


7212 


RTAOOOOO 1 93 AF.b. 14.1 


X AAAAA A^"> ATI ./^» AT 

M00004230B:C07 


281 


4015 


RTA00000136A.e.l2.1 


X AAAAA 1 C yl A A . DAI 

M00001 549A:B02 


282 


12977 


RTAOOOOO 1 89 AF.j .19.1 


X A A A AA*> OTCD. T? A /I 

M00003875B:r04 


283 




RTAOOOOO 1 78AF.m. 13 . 1 


» AAAAA 1 OOA T~> . A 1 1 

M00001384B:A1 1 


284 


14391 


r\ T* A AAA 1 A 1 A 1 **T 1 

RTAOOOOO 1 9 1 AF.l. 7. 1 


X AAAAA A AO 1 /"^ . r\ 1 O 

M0000408 1 C: L) 1 2 


285 




1~» T" A AAAAA1A>< A T~ ' — . **» *> 1 

RTAOOOOO 1 94AF.C.23 . 1 


TV A A A AA/f A 1 T"\ • AAC 


286 




RTAOOOOO 1 8 1 AF.b.7. 1 


XAAAAA1 /1/C3D.EA1 

M 0 00U 1 443 B : r 0 1 


287 


8358 


T» A AAAAA t O A T" * ^ 1 

RTAOOOOO 1 83 AF.l. 5. 1 


TVAAAAAT OOD.UA/l 

M0000 1 528B:H04 


288 


1267 


T"k T"* A AA AAA 1 A A _ C 1 

RTA0O000 1 25 A.o.5 . 1 


X AAAAA 1 A .PI 1 

MOUUO 1 546A:0 1 1 


289 




RTA00000189AF.I.7.1 


X A A A AA1 OC1 D. HAO 


290 


16347 


T* T* 1 A AAAAA 1 O A A 1 C 1 

RTAOOOOO 1 84 AF.e. 15.1 


XAAAAA1 C/IOr.CA^ 


291 


7899 


TV T" A AAAAA1AO A T* _ 1 ,- 7 1 

RTAOOOOO 1 93 AF.a. 1 7. 1 


A A A A A A A T> "7 X> . TT\AO 

MUUUU4223B:L)Uy 


292 


2379 


T>Tn A AAAAA 1 TO A T « ^ 1 

RTAOOOOO 1 78 AF.a.6.1 


N AAAAA T H-CAO 

JvlUUUU 1 3oI JJ.rUo 


293 


39478 


Tk r T^ A AAAAA1 *> *> A 'CI 

RTAOOOOO 1 33 A.i.5. 1 


XAAAAAI/ITI A -DAI 

MUUUU 1 47 1 A:BU1 


294 


39392 


T» TP A AAAAA 1 *1 /I A 1 / 1 

RTAOOOOO 1 34 A.m . 1 6. 1 


XAAAAA1 Q^JtC A .PAQ 

MUUUU 1 536A:CU8 


295 


5053 


RTAOOOOO 1 84AF.0. 12.1 


M00001564A:B12 


296 


16999 


RTAOOOOO 1 85 AF.k.9.1 


M00001598A:G03 


297 


39180 


RTAOOOOO 126A.n.8. 2 


M00001551A:F05 


298 


1037 


RTA00000121A.f.8.1 


M00001470A:B10 
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SEQIDNO: Cluster ID Sequence Name 



299 


6867 


RTA00000178AF.e.l2.1 


300 


10539 


RTA00000183AF.a.24.1 


301 


41633 


RTA00000118A.g.l6.1 


302 


23218 


RTA00000187AR.C.5.2 


303 


39380 


RTA00000129A.e.24.1 


304 




RTA00000185AF.d.24.1 


305 




RTA00000177AF.O.4.3 


306 


6974 


RTA00000184AF.a.l5.1 


307 




RTA00000185AF.g.ll.l 


308 


15855 


RTA00000184AF.j.l.l 


309 


84328 


RTA00000118A.p.l0.1 


310 


10145 


RTA00000120A.g.l2.1 


311 


39805 


RTA00000177AFx.21.3 


312 




RTA00000 1 87AF.h.23 . 1 


313 


6298 


RTA000001 87AR.i. 1 0.2 


314 


14367 


RTA00000187AF.e.8.1 


315 




RTA00000193AF.C.22.1 


316 


16921 


RTA00000183AF.k.6.1 


317 


1577 


RTA00000184AF.i.23.1 


318 


8773 


RTA00000187AF.f.24.1 


319 




RTA00000194AF.a.ll.l 


320 


39886 


RTA00000178AFJ.24.1 


321 


13532 


RTA00000181AF.C.4.1 


322 




RTA00000193AF.d.2.1 


323 


5257 


RTA00000192AFX3.1 


324 


9061 


RTA00000191AR.e.ll.2 


325 


19267 


RTA00000186AF.L12.1 


326 


20212 


RTA00000134A.L22.1 


327 


16653 


RTA00000181AF.k.5.3 


328 


16985 


RTA00000177AF.h.l0.1 


329 


12977 


RTA00000189ARJ.19.1 


330 


9061 


RTA00000191AR.e.ll.3 


331 




RTA00000194AR.a.l0.2 


332 


6468 


RTA00000187AF.d.l5.1 


333 


16392 


RTA00000192AF.1.1.1 


334 


14627 


RTA00000187AF.g.23.1 


335 


6583 


RTA00000179AF.d.l3.1 


336 


6806 


RTA0OO00177AF.g.l3.3 


337 


9635 


RTA00000137A.e.23.4 


338 


689 


RTA00000181AR.L22.1 


339 


4119 


RTA00000183AF.k.l6.1 


340 


8952 


RTA00000183AF.h.l5.1 


341 


2379 


RTA00000192AF.p.8.1 



148 




Clone Name 

M00001370A:C09 

M00001499B:A1 1 

M00001449A.-B12 

M00001662C:A09 

M00001587A:B11 

M00001582D:F05 

M00001358C:C06 

M00001544B.B07 

M00001590B:F03 

M00001556A:H01 

M00001452A:B04 

M00001465A:B11 

M00001342B:E06 

M00001679A.F06 

M00001679B:F01 

M00001670C:H02 

M00004249D:G12 

M00001534A:C04 

M00001556A:F11 

M00001675A:C09 

M00004461A:B09 

M00001380D.-B09 

M00001445A:F05 

M00004251C:G07 

M00004146C:C11 

M00004031A:A12 

M00001645A:C12 

M00001535A:C06 

M00001453C:F06 

M00001348B:G06 

M00003875B:F04 

M00004031A:A12 

M00004461A:B08 

M00001669B:F02 

M00004183C:D07 

M00001677C:E10 

M00001394A:F01 

M00001346D:E03 

M00001557A:F01 

M00001454D:G03 

M00001534C:A01 

M00001518C:B11 

M00004212B:C07 



BNSDOCID: <WO 9933982A2J_> 




WO 99/33982 
SEQ ID NO: Cluster ID Sequence Name 



342 


39486 


RTA00000128A.m.22.2 


343 


21877 


RTA00000 1 89AF.b.2 1 . 1 


344 


6874 


RTA00000 1 92 AF.a. 14.1 


345 


6874 


RTA00000189AF.e.9.1 


346 


37285 


RTA00000191AF.f.l 1.1 


347 




RTA000OO 1 93 AF.j .20. 1 


348 


7674 


RTA000001 18A.g.9.1 


349 


2797 


RTAOOOOO 180AF.i. 19.1 


350 




RTA00000 1 84 AF.g.22 . 1 


351 


7802 


RTA00000185AF.n.5.1 


352 


16921 


RTA00000193AF.h.l5.1 


353 


11494 


RTA00000192AF.j.6.1 


354 


17062 


RTA00000177AF.b.8.4 


355 


16245 


RTA00000177AF.k.9.3 


356 


83103 


RTA00000119A.e.24.2 


357 


4309 


RTA00000186AF.e.22.1 


358 


13072 


RTA00000 1 8 1 AR.m.5.2 


359 


4059 


RTA00000 1 77AF.n. 1 8.3 


360 


5178 


RTA00000 1 78 AF.n. 10.1 


361 


1120 


RTA00000118A.p.l5.3 


362 


6420 


RTA00000183AF.d.ll.l 


363 


13913 


RTA00000186AF.e.6.1 


364 




RTA00000192AF.C.2.1 


365 


3956 


RTAO000O183AF.g.3.1 


366 


14364 


RTA00000183AF.g.l2.1 


367 


6880 


RTAOOOOO 1 9 1 AF.m.20. 1 


368 


84182 


RTAO0000180AF.h.l9.1 


369 


2790 


RTAOOOOO 1 77 AF.e.2.1 


370 


4561 


RTA00000184AF.i.21.1 


371 


8847 


RTAOOOOO 1 80AF.b. 1 6. 1 


372 


56020 


RTAOOOOO 1 93 AF.g.2.1 


373 


1531 


RTA00000119A.O.3.1 


374 


6420 


RTAOOOOO 1 77AF.f. 10.3 


375 




RTAOOOOO 1 88 AF.b. 12.1 


376 




RTAOOOOO 1 80AF.k.24. 1 


377 




RTAOOOOO 1 84 AF.a.8.1 


378 


2696 


RTA00000134A.m. 13.1 


379 


260 


RTA000001 85AR.U2.2 


380 


11350 


RTAOOOOO 189AF.a.24.2 


381 


2428 


RTAOOOOO 1 23 A.1.2 1.1 


382 


4313 


RTAOOOOO 122A.n.3.1 


383 




RTAOOOOO 184AF.p.3.1 


384 


697 


RTAOOOOO 1 88 AF.d.6.1 



149 



^ PCT/US98/27610 
Clone Name 

M0O001561A:C05 

M0O0O3833A:E05 

M00004111D:A08 

M00003846B:D06 

M0O004035C:A07 

M00004327B:H04 

MOO001416A:H01 

M00001429A:H04 

MOO0O1552D:A01 

M0O0O1608A:B03 

M00004295D:F12 

M00004172C:D08 

M00001340B:A06 

MOO001352A:EO2 

M00001454A:A09 

M00001624C:F01 

M00001455B:E12 

M00001357D:D11 

M00001386C:B12 

MOO001452A:D08 

M00001504D:G06 

M00001623D:F10 

M00004121B:G01 

M00001512D:G09 

M00001513C:E08 

M00004087D:A01 

M00001428A:H10 

M00001343C:F10 

M00001555D:G10 

M00001416B:HU 

M00004285B:E08 

M00001461A:D06 

M00001345A:E01 

M00003754C:E09 

M00001432C:F06 

M00001544A:E06 

M00001536A:B07 

M00001594B:H04 

M00003826B:A06 

M00001533A:C11 

M00001517A:B07 

M00001566B:D11 

M00003759B:B09 



BNSDOCID: <WO 99339B2A2J_> 




SEQIDNO: Cluster ID 



385 


5619 


386 


4568 


387 




388 


5622 


389 


7514 


390 


5619 


391 


7570 


392 


23358 


393 


23210 


394 


5192 


395 


13538 


396 




397 




398 




399 


4718 


400 


14929 


401 


4908 


402 




403 




404 


17732 


405 


17062 


406 


11589 


407 


4443 


408 


39805 


409 


2790 


410 


23255 


411 


6420 


412 


5007 


413 


13576 


414 


16927 


415 


16985 


416 


3584 


417 




418 


7187 


419 


16245 


420 


8078 


421 


14929 


422 


14391 


423 


4141 


424 


2379 


425 


5622 


426 


945 


427 


40132 




Sequence Name 


Clone Name 


RTA00000188AF.1.9.1 


M00003796C:D05 


RTA00000122A.d.l5.3 


M00001513A:B06 


RTA00000177AF.i.6.2 


M00001350A:B08 


RTA00000178AF.a.ll.l 


M00001362B:D10 


RTA00000184AF.k.21.1 


M00001558B:H11 


RTA00000189AF.f.l7.1 


M00003853A:D04 


RTA00000187AF.g.24.1 


M00001677D:A07 


RTA00000190AF.O.21.1 


M00003974D:H02 


RTA00000190AF.O.20.1 


M00003974D:E07 


RTA00000184AF.k.2.1 


M00001557B:H10 


RTA00000180AF.a.24.1 


M00001415A:H06 


RTA00000189AF.h.I7.1 


M00003867A:D10 


RTA00000192AF.O.11.1 


M00004205D:F06 


RTA00000184AF.1.11.1 


M00001559B:F01 


RTA00000189AF.g.5.1 


M00003857A:H03 


RTA00000177AF.m.l.2 


M00O01353D:D10 


RTA00000192AF.j.2.1 


M00004171D:B03 


RTA00000178AF.k.l6.1 


M00001381D:E06 


RTA00000 1 94 AF.c.24. 1 


M00004692A:H08 


RTA00000178AR.L2.2 


M00001376B:G06 


80.Al.sp6:130208.Seq 


M00001340B:A06 


80.Bl.sp6:130220.Seq 


M00001340D:F10 


8O.Cl.sp6:130232.Seq 


M00001341A:E12 


80.Dl.sp6:130244.Seq 


M00001342B:E06 


80.El.sp6:130256.Seq 


M00001343C:F10 


80.Fl.sp6:130268.Seq 


M00001343D:H07 


8O.Gl.sp6:130280.Seq 


M00001345A:E01 


8O.Hl.sp6:130292.Seq 


M00001346A:F09 


80.D2.sp6:130245.Seq 


M00001347A:B10 


80.E2.sp6:130257.Seq 


M00001348B:B04 


80.F2.sp6:130269.Seq 


M00001348B:G06 


80.G2.sp6: 13028 l.Seq 


M00001349B:B08 


80.H2.sp6:130293.Seq 


M00001350A:B08 


80.A3.sp6:130210.Seq 


M00001350A:H01 


80.D3.sp6:130246.Seq 


M00001352A:E02 


80.E3.sp6:130258.Seq 


M00O01353A:G12 


80.F3.sp6:130270.Seq 


M00001353D:D10 


80.G3.sp6:130282.Seq 


M00001355B:G10 


80.B4.sp6:130223.Seq 


M00001361A:A05 


80.C4.sp6:130235.Seq 


M00001361D:F08 


80.D4.sp6:130247.Seq 


M00001362B:D10 


8O.E4.sp6:130259.Seq 


M00001362C:H11 


80.F4.sp6: 13027 l.Seq 


M00001365C.C10 


150 





BNSDOCID: <WO 9933982A2J_> 




PCT/US98/27610 



SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


428 




80.G4.sp6: 1 30283. Seq 


JvlUUUU t jooUxUJ 


429 


6867 


80.H4.sp6: 130295. Seq 


iviuuuu i j /u/\.L,uy 


430 


7172 


o r\ a c ~ f i o no 1 *\ o 

80. A5.sp6: 1 302 1 2. Seq 


jviuuuu i j / ictuy 


431 


17732 


80.B:>.sp6: 1 30224. Seq 


IVIUUUU I J /OD.UUO 


432 


39833 


80.C5.sp6: 1 30236. Seq 


IVIUUUU 1 J /otJ.JoUZ 


433 


1334 


80.D5.sp6: 1 30248. Seq 


iviuuuu l j /y/\./\u-) 


434 


39886 


80. E5.sp6:l 30260. Seq 


JVIUUUU 1 JOUU.DU7 


435 




80.F5.sp6: 1 30272. Seq 


IVIUUUU 1 Jo 1 UXvO 


436 


22979 


80.G5.sp6: 130284. Seq 


IVIUUUU 1 joZL.AUZ 


437 


39648 


on it^ — i no n o ~ 

80.H5.sp6: 1 30296. Seq 


IVIUUUU 1 jojA.LUj 


438 




80.B6.sp6: 1 30225. Seq 


MUUUU 1 3o4r>. A I 1 


439 


5178 


80.C6.sp6: 1 30237. Seq 


\Arif\f\f\ 1 ^ cap - n 1 0 
MUUUU i joot.o 1 Z 


440 


2464 


80.D6.sp6: 1 30249.Seq 


MUUUU 1 Jo /A.L.U3 


441 


7587 


80.E6.sp6: 1 3026 1 .Seq 


IV A f\f\f\f\ 1 1 0~7D . CLiYl 

MUUUO 1 3 0 / d . UU 3 


442 


5832 


80.F6.sp6: 1 30273 .Seq 


\ A\r\f\f\c\'\ < 2ocn.r , n< 


443 


16269 


80.G6.sp6:l 30285. Seq 


» A(\f\f\f\ 1 oon a .PrtQ 

M000U 1 3o9A:Cuo 


444 


6583 


80.H6.sp6: 1 30297. Seq 


MUUUU 1 JV4A.rU I 


445 


4009 


80.A7.sp6: 1 302 1 4. Seq 


\ifAAAA1 A .rAQ 

M00OO 1 39oA:C03 


446 




80.B7.sp6: 1 30226. Seq 


KvfAAAAl /lAHD'UA^ 

MUUUU 1 4UUfc>.HUo 


447 


39563 


80.C7.sp6:130238.Seq 


\yfAAAA1 /tA") A .HAC 

MUUUU 1 4UZ A.tvUo 


448 


5556 


80.D7.sp6: 1 30250.Seq 


AvfAAAAl A A*7H> • T\ 1 1 

MUUUU 14U/r>.LM 1 


449 


9577 


80.E7.sp6: 1 30262. Seq 


* m AAAA 1 A AOP«n 1 O 

MUUUU 1 4UyC D 1 2 


450 


7005 


80.F7.sp6: 1 30274. Seq 


K A AA AA 1 /I 1 A A • AA7 

MUUUU 1 4 1 UA.LHJ / 


451 


8551 


80.G7.sp6: 1 30286. Seq 


\/fAAAA1 /I 1 O D • D 1 A 

MUUUU i 4 lzt>.r> 1 U 


452 




80.H7.sp6: 1 30298. Seq 


\ilAAAA1 /I 1 A A .DAI 

MUUUU 1 4 14A.DU J 


453 




80.A8.sp6: 1 302 1 5. Seq 


A yf AAAA 1 A 1 vH~V A AT 

MUUUU 1 4 1 4L. AU / 


454 


13538 


80.B8.sp6: 1 30227.Seq 


A >f AAAA 1 A 1 C A -TJAA 

MUUUU 1 4 1 A.HUo 


455 


8847 


80.C8.sp6: 1 30239.Seq 


A VlAAAA I/IKD.UI 1 

MUUUU 1 4 I oo.rl 1 1 


456 


36393 


80.D8.sp6: 1 3025 1 .Seq 


A/f AAAA 1 /I 1 "7 A .CA1 

MUUUU14 1 / A.t/UZ 


457 


9952 


80.E8.sp6: 1 30263 .Seq 


K/f AAAA 1 >d 1 CR'Fni 
MUUUU 14 I 5Jt3.rU j 


458 


9577 


80.G8.sp6: 130287. Seq 


A A A AAA 1 /!"> 1 1^*17 A 1 

MUUUU 1 4Z 1 L.rul 


459 


15066 


on t to ^.nrtOAn c* ^ _ 

80.H8.sp6: 1 30299. Seq 


IVvIAAAA 1 A'JIVl'Ufi'l 
MUUUU 1 hZjD.cU / 


460 


10470 


on a n ^ t *i n^ i o 

80. A9.sp6: 1 3 02 1 6.Seq 


AifAAAAl /IT/ID - f~lAQ 

MUUUU 1 4Z4D.OUV 


461 


22195 


on r* n z - t'^ni'^o o 

80.B9.sp6: 1 30228. Seq 


\A AAAA 1 /lO^Ti-l-inft 

MUUUU 1 4ZZ>Jt3.rlUo 


462 




80.C9.sp6: 1 30240.Seq 


A A AAAA 1 'FM 1 
MUUUU J 4ZOt>.Ly I Z 


463 


4261 


on r\A s i i n^ n c* 

80.D9.sp6: 1 30252. Seq 


A/iAAAAi a o/^r^ ♦r , n© 

MUUUU 14/OU.L/Uo 


464 


84182 


on ■""'n z i o n ^ z a o 

80.E9.sp6: 1 30264. Seq 


MUUUU 1 4ZoA.rl 1 U 


465 


40392 


on fin 1 1 AiArt ~ 

80.H9.sp6: 130300. Seq 


v/iaaaai zioon-nA? 

MUUUU 1 4ZVU.UU / 


466 


16731 


o n /""i in i o A^ i o ~ 

80.C 1 0.sp6: 1 3024 1 .Seq 


A4AAAA1 A /lOP-nni 
MUUUU 1 44ZL/.L/U / 


467 




on r\ 1 n „ .to no c o o ^ 

80. DIO. spoil 30253. Seq 


MUUUU I 443t5.rU 1 


468 


13532 


80.E10.sp6:130265.Seq 


M00001445A:F05 


469 


8 


80.H10.sp6:13030l.Seq 


M00001448D:C09 


470 


36313 


80.All.sp6:130218.Seq 


M00001448D:H01 



151 



BNSDOCID: <WO 9933982A2J_: 




SEQ ID NO 


: Cluster ID 


Sequence Name 


471 


5857 


8O.Bll.sp6:130230.Seq 


472 


41633 


80.Cll.sp6:130242.Seq 


473 


36535 


80.Dll.sp6:130254.Seq 


474 


86110 


80.Ell.sp6:130266.Seq 


475 


32663 


8O.Fll.sp6:130278.Seq 


476 


27250 


80.Gll.sp6:130290.Seq 


477 


16970 


80.Hll.sp6:130302.Seq 


478 


16130 


8O.A12.sp6:130219.Seq 


479 


16653 


80.B12.sp6:130231.Seq 


480 


7005 


80.C12.sp6:130243.Seq 


481 


13072 


80.F12.sp6:130279.Seq 


482 


9283 


80.G12.sp6:130291.Seq 


483 


23255 


100.Cl.sp6:131446.Seq 


484 


13576 


100.El.sp6:131470.Seq 


485 


7187 


100.C2.sp6:131447.Seq 


486 


14391 


100.E3.sp6:131472.Seq 


487 


945 


100.E4.sp6:131473.Seq 


488 


7172 


100.A5.sp6:131426.Seq 


489 


39648 


100.A6.sp6:131427.Seq 


490 


84182 


100.G9.sp6:131502.Seq 


491 


8 


100.Bll.sp6:131444.Seq 


492 


36535 


100.Dll.sp6:131468.Seq 


493 


82498 


100.Fll.sp6:131492.Seq 


494 


16970 


100.C12.sp6: 13 1457.Seq 


495 


16130 


100.D12.sp6:131469.Seq 


496 


7005 


121.Dl.sp6:131917.Seq 


497 




121.G6.sp6:131958.Seq 


498 


18957 


121.F7.sp6:131947.Seq 


499 


40044 


122.El.sp6:132121.Seq 


500 


5214 


122.C2.sp6:132098.Seq 


501 


6660 


122.B5.sp6:132089.Seq 


502 


13183 


123.D5.sp6:132305.Seq 


503 


6455 


123.E7.sp6:132319.Seq 


504 


5319 


123.F7.sp6: 13233 l.Seq 


C f\ c 

505 


11443 


123.A8.sp6:132272.Seq 


r r\ s~ 

506 




123.C8.sp6:132296.Seq 


507 


8210 


123.E8.sp6:132320.Seq 


508 


9457 


123.Dll.sp6:13231 l.Seq 


509 


6420 


172.El.sp6:133925.Seq 


510 


16245 


172.D2.sp6:133914.Seq 


511 


8078 


172.C3.sp6:133903.Seq 


512 


14929 


172.D3.sp6:133915.Seq 


513 


14391 


172.H3.sp6:133963.Seq 



152 




PCTAJS98/27610 
Clone Name 

M00001449A:A12 

M00001449A:B12 

M00001449A:G10 

M00001449C:D06 

M00001450A:A11 

M00001450A:D08 

M00001452C:B06 

M00001453A:E11 

M00OO1453C:F06 

M00001454B:C12 

M00001455B:E12 

M00001455D:F09 

MOO0O1343D:H07 

M00001347A:B10 

M00001350A:H01 

M00001355B:G10 

M00001362C:H11 

MO0001371C:E09 

M00001383A.-C03 

M00001428A:H10 

M00001448D:C09 

M00001449A.G10 

M00001450A:B12 

M00001452C:B06 

M00001453A:E11 

M00001454B:C12 

M00001506D:A09 

M00001528A:F09 

M00001621C:C08 

M00001630B:H09 

M00001679A:A06 

M00004U4C:F11 

M00004157C:A09 

M00004169C:C12 

M00004185C:C03 

M00004191D:B11 

M00004197D:H01 

M00004307C:A06 

M00001345A:E01 

M0O001352A:E02 

M00001353A:G12 

M00001353D:D10 

MOOOOI355B:G10 



BNSDOCID: <WO 9933982A2_L> 




PCT/US98/276I0 



SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


514 


6583 


1 72.B8.sp6: 1 33896.Seq 


M0000 1 394 A:F0 1 


515 


4009 


1 72.D8.sp6: 1 33920. Seq 


X (AAA/1 1 1A/ A - /"^ A ^ 

M0000l396A:C0j 


516 




1 72.B9.sp6: 1 33897. Seq 


M00001400B:H06 


517 




176.A3.sp6:134514.Seq 


MOOO0 1 632D:H07 


518 


19267 


176.G3.sp6:134586.Seq 


IV vf A AAA \ ii A C A ./""'I'l 

M0000 1 645 A:C 1 2 


519 


78091 


176.G5.sp6:134588.Seq 


iv /f a a a a i /: Trk/^ 1 .t?a i 

M0000 1 679C:r 0 1 


520 


17055 


176.D6.sp6:134553.Seq 


KXAAAA1 /O^/^ T"i i -> 

M00001682C:B12 


521 


6539 


1 76.D9.sp6: 1 34556. Seq 


\ ylAAAA^ O A A . T~> 1 1 

M00003544C:B1 1 


522 




1 77.H4.sp6: 1 34791 .Seq 


x 4 a a a a a t i r> ./-^ a i 

M00004121B:G01 


523 


5257 


1 77.F5.sp6: 1 34768. Seq 


a a" f\r\f\r\ a i a zr/^ 1 ./^* 1 1 

M00004146C:C1 1 


524 


1 1494 


1 77.E6.sp6: 1 34757.Seq 


M00004 1 72C:D08 


525 




1 77.G7.sp6: 1 34782. Seq 


h A f\l\f\i\ A~S f\C r^V.TAZT 

M00004205D:F0o 


526 


11451 


1 77.D8.sp6: 1 34747.Seq 


M00004214C:H05 


527 


9283 


1 73.D2.SP6: 134106.Seq 


\ / AAAA1 A C C r~"V T~" AO 

M0000 1 455D:F09 


528 


16283 


173.F3.SP6:134131.Seq 


M0000 1 467 A: D08 


529 


10539 


173.B5.SP6: 134085. Seq 


M00001499B:A1 1 


530 


6420 


173.F5.SP6:134133.Seq 


M0000 1 504D:G06 


531 


3956 


173.H5.SP6:134157.Seq 


M00001512D:G09 


532 




1 73.G7.SP6: 1 34 1 47.Seq 


M00001 544A:E06 


533 


1577 


173.C9.SP6:134101.Seq 


* / AAAA 1 C C /" A T~* 1 1 

M00001556A:F1 1 


534 


9635 


1 73.D9.SP6: 1 341 1 3. Seq 


M00001557A:F01 


535 


5192 


173.E9.SP6:134125.Seq 


» 4AAAA1 C C 1 T* .111 A 

M0000 1 557B:H 1 0 


536 


6539 


1 73 .A 1 2.SP6: 1 34080.Seq 


X /AAAA 1 f <7Ar\ A "y 

M00001579D:C03 


537 


945 


1 80.C2.sp6: 1 35940.Seq 


M00001362C:H1 1 


538 


7005 


180.H5.sp6:136003.Seq 


X A A A A A 1 A 1 A A PvA "7 

M0000 1 4 1 0 A:D07 


539 


39304 


1 80.G9.sp6: 1 3 5995 .Seq 


X A A A A A 1 A C A A . A Al 

M0000 1 450A: A02 


540 


27250 


1 80.B 1 0.sp6: 1 35936. Seq 


X A A A A A 1 A C A A . r\AO 

M0000 1 450A:DU8 


541 


35555 


184.A5.sp6:135530.Seq 


* A A AAA 1 O A . /""• A A 

M0000 1 528 A:C04 


542 


19255 


184.B10.sp6:135547.Seq 


X 4 A A A A "I C A C A . PA? 

M00001 545A:C03 


543 


6268 


1 84. C12.sp6: 135561. Seq 


X A A A A A 1 rf 1 A . 1~> 1 A 

M00001551 A:B10 


544 


3277 


2 1 7.E 1 .sp6 : 1 3 9406.Seq 


X A AAA A 1 £ ^ A A . A C 

M0000 1 624 A:BUo 


545 


39171 


2 1 7. A 1 2.sp6: 1 39369.Seq 


X A A AAA 1 H A A t~~* .T> A*7 

M0000 1 644C:BU7 


546 


11460 


2 1 9.F2.sp6: 1 39035. Seq 


X A A AAA 1 ^I^D.rAC 

M00001 676B:rUj 


547 


10539 


2 1 9.F6.sp6: 1 39039.Seq 


X A AAA A 1 OAA. A O 


548 


11476 


2 1 9.H8.sp6: 1 39065 .Seq 


X 4AAAA? T A HA.PAC 

M00003747D:CO5 


549 


4016 


79. Al .sp6: 1 300 1 6.Seq 


X A A AAA 1 1AC A . AAI 

M 0 000 1 3 9!) A : CU3 


550 


7674 


79.C 1 .sp6: 1 30040.Seq 


X >f AAAA1 /I 1 A .UA1 

M00001416A:HU1 


551 


3681 


79.El.sp6: 130064. Seq 


X A A AAA 1 /I ACa A . 1 O 


552 


39304 


79.F 1 .sp6: 1 30076. Seq 


X A A A A A 1 A C A A . A A^ 

M00001450A:AU2 


553 


82498 


79.G 1 .sp6: 1 30088.Seq 


XAAAAA1 A C A A . D 1 O 

M00001450A:tJ12 


554 


84328 


79.A2.sp6:130017.Seq 


M00001452A:B04 


555 


86859 


79.B2.sp6:130029.Seq 


M00001452A:B12 


556 


1120 


79.C2.sp6: 130041. Seq 


M00001452A:D08 



153 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 



PCT/US98/27610 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


c r "7 

557 


85064 


79.D2.sp6:l30053.Seq 


M0O0O1452A:FO5 


c c o 

558 


83103 


79.G2.sp6:l30089.Seq 


M00001454A:A09 


559 


10145 


79.F3.sp6:l30078.Seq 


M0000I465A:B11 


c n c\ 

560 


16283 


79.H3.sp6:l30l02.Seq 


M00001467A:D08 


c r 1 

56 1 


4568 


79.D4.sp6:l30055.Seq 


MOO001513A:B06 


C f o 

562 


A 1 1 O 

43 13 


79.F4.sp6:l30079.Seq 


M00001517A:B07 


563 


2428 


79.A5.sp6:l30020.Seq 


M00001533A:C11 


564 


39423 


79.C5.sp6:l30044.Seq 


M00001535A:F10 


ere 

565 


39174 


79.E5.sp6:l30068.Seq 


M00001541A:H03 


566 


221 13 


79.F5.sp6:l30080.Seq 


M00001542A:A09 


567 


19829 


79.H5.sp6:l30l04.Seq 


M00001544A:G02 


568 


13864 


79.B6.sp6:l30033.Seq 


M00001545A:D08 


569 


1058 


79.F6.sp6:l3008l.Seq 


M00001548A:H09 


570 


4015 


79.G6.sp6:l30093.Seq 


M00001549A:B02 


571 


39180 


79.A7.sp6:l30022.Seq 


M00001551A:F05 


572 


307 


79.C7.sp6:l30046.Seq 


M00001552A:B12 


573 


39458 


79.D7.sp6: 130058. Seq 


M00001552A:D11 


574 


39490 


79.G7.sp6:130094.Seq 


M00001557A:F03 


575 


39486 


79.B8.sp6:130035.Seq 


M00001561A:C05 


576 


39380 


79.E8.sp6:130071.Seq 


M00001587A:B11 


577 


1399 


79.G8.sp6:130095.Seq 


M00001604A:B10 


578 


39391 


79.A9.sp6:130024.Seq 


M00001604A:F05 


579 


6268 


79.G9.sp6:130096.Seq 


M00001551A:B10 


580 




377.F4.sp6:141957.Seq 


M00004692A:H08 


581 


2448 


89.Al.sp6:130667.Seq 


M00001460A:F06 


582 


1531 


89.Cl.sp6:130691.Seq 


M00001461A:D06 


583 


19 


89.Dl.sp6:130703.Seq 


M00001463C:B11 


C O A 

584 


38759 


89.Fl.sp6:130727.Seq 


M00001467A:B07 


cor 

585 


39508 


89.Gl.sp6:130739.Seq 


M00001467A:D04 


cor 

586 


16283 


89.Hl.sp6:130751.Seq 


M00001467A:D08 


COT 

587 


39442 


89.A2.sp6:130668.Seq 


M00001467A:E10 


coo 
588 


tc or* 

7589 


89.B2.sp6:130680.Seq 


M00001468A:F05 


coo 
589 




89.C2.sp6:130692.Seq 


M00001469A:A01 


590 


1 ^AO 1 

12081 


89.D2.sp6:130704.Seq 


M00001469A.C10 


Cfl 1 

591 


19105 


89JE2.sp6:130716.Seq 


M00001469A:H12 


592 


1037 


89.F2.sp6:130728.Seq 


M00001470A:B10 


f A'} 

593 


39425 


89.G2.sp6:130740.Seq 


M00001470A:C04 


Ct\ A 

594 


39478 


89.H2.sp6:130752.Seq 


M00001471A:B01 


595 




89.B3.sp6:130681.Seq 


M00001487B:H06 


596 




oV.C3.spo. 13U693.S>eq 


M00001488B:F12 


597 


18699 


sy.Lo.spo. 1 JU /UD.aeq 


M0000 1 490B :C04 


598 


7206 


89.E3.sp6:130717.Seq 


M00001494D:F06 


599 


2623 


89.F3.sp6:130729.Seq 


M00001497A:G02 
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BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 
SEQIDNO: Cluster ID 

600 10539 

601 5336 

602 2623 

603 9443 

604 9685 
605 

606 10185 

607 6974 

608 6420 
609 

610 

611 39168 

612 39412 

613 39186 

614 3956 
615 

616 14364 

617 40044 

618 8952 

619 35555 

620 18957 

621 8358 

622 38085 
623 

624 3990 

625 16921 

626 5321 

627 4119 

628 20212 

629 2696 

630 39392 

631 39420 

632 3389 

633 8286 

634 3765 

635 39453 
636 

637 

638 6974 
639 

640 19255 

641 1267 

642 5892 



Sequence Name 

89.G3.sp6:130741.Seq 

89.H3.sp6:130753.Seq 

89.A4.sp6:130670.Seq 

89.B4.sp6:130682.Seq 

89.C4.sp6:130694.Seq 

89.D4.sp6:130706.Seq 

89.E4.sp6: 13071 8.Seq 

89.F4.sp6:130730.Seq 

89.G4.sp6:130742.Seq 

89.H4.sp6:130754.Seq 

89.A5.sp6:130671.Seq 

89.B5.sp6:130683.Seq 

89.C5.sp6:130695.Seq 

89.D5.sp6:130707.Seq 

89.E5.sp6:130719.Seq 

89.F5.sp6: 13073 l.Seq 

89.G5.sp6:130743.Seq 

89.H5.sp6:130755.Seq 

89.A6.sp6:130672.Seq 

89.B6.sp6:130684.Seq 

89.C6.sp6:130696.Seq 

89.D6.sp6:130708.Seq 

89.E6.sp6:130720.Seq 

89.F6.sp6:130732.Seq 

89.G6.sp6:130744.Seq 

89.H6.sp6:130756.Seq 

89.B7.sp6:130685.Seq 

89.C7.sp6:130697.Seq 

89.E7.sp6: 13072 l.Seq 

89.F7.sp6:130733.Seq 

89.G7.sp6:130745.Seq 

89.H7.sp6:130757.Seq 

89.A8.sp6:130674.Seq 

89.B8.sp6:130686.Seq 

89.C8.sp6:130698.Seq 

89.E8.sp6:130722.Seq 

89.F8.sp6:130734.Seq 

89H8.sp6:130758.Seq 

89.A9.sp6:130675.Seq 

89.B9.sp6:130687.Seq 

89.C9.sp6:130699.Seq 

89.D9.sp6: 13071 l.Seq 

89.E9.sp6:l 30723 .Seq 
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PCTAJS98/27610 
Clone Name 
M00001499B:A1 1 
M00001500A:C05 
M00001500A:E11 
M00001500C:E04 
M00001501D:C02 
M00001504A:E01 
M00001504C:A07 
M00001504C:H06 
M00001504D:G06 
M00001505C:C05 
M00001506D:A09 
M00001507A:H05 
M00001511A:H06 
M00001512A:A09 
M00001512D:G09 
M00001513B:G03 
M00001513C:E08 
M00001514C:D11 
M00001518C:B11 
M00001528A:C04 
M00001528A:F09 
M00001528B:H04 
M00001531A:D01 
M00001531A:H11 
M00001532B:A06 
M00001534A:C04 
M00001534A:F09 
M00001534C:A01 
M00001535A:C06 
M00001536A:B07 
M00001536A:C08 
M00001537A:F12 
M00001537B:G07 
M00001540A:D06 
M00001541A:D02 
M00001542A:E06 
M00001542B:B01 
M00001544A:E06 
M00001544B:B07 
M00001545A:B02 
M00001545A:C03 
M00001546A:G11 
M00001548A:E10 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 
SEQ ID NO: Cluster ID 

643 4193 

644 16347 

645 7239 

646 5175 

647 22390 

648 3266 

649 5708 
650 

651 8298 

652 4573 

653 22814 

654 39539 

655 39195 

656 4561 

657 9244 

658 1577 

659 4386 

660 11294 

661 5192 

662 8761 
663 

664 7514 
665 

666 6558 

667 102 
668 

669 5749 

670 6539 

671 6293 
672 

673 260 

674 4837 

675 10470 

676 16999 

677 22794 

678 11465 

679 7802 

680 22155 
681 

682 13157 

683 17004 

684 40314 

685 40044 



Sequence Name 

89.G9.sp6:130747.Seq 

89.H9.sp6:130759.Seq 

89.A10.sp6:130676.Seq 

89.B10.sp6:130688.Seq 

89.C10.sp6:130700.Seq 

89.D10.sp6:130712.Seq 

89.E10.sp6:130724.Seq 

89.F10.sp6:130736.Seq 

89.G10.sp6:130748.Seq 

89.H10.sp6:130760.Seq 

89.All.sp6:130677.Seq 

89.Bll.sp6:130689.Seq 

89.Cll.sp6:130701.Seq 

89.Dll.sp6:130713.Seq 

89.Ell.sp6:130725.Seq 

89.Fll.sp6:130737.Seq 

89.Hll.sp6:l 30761. Seq 

89.A12.sp6:130678.Seq 

89.D12.sp6:130714.Seq 

89.E12.sp6:130726.Seq 

89.F12.sp6:130738.Seq 

89.G12.sp6:130750.Seq 

89. H12.sp6:130762.Seq 

90. Al.sp6:130859.Seq 
90.Bl.sp6: 130871. Seq 
90.Dl.sp6:130895.Seq 
90.El.sp6:130907.Seq 
90.Gl.sp6: 130931. Seq 
90.A2.sp6:130860.Seq 
90.C2.sp6:130884.Seq 
90.D2.sp6:130896.Seq 
90.E2.sp6:130908.Seq 
90.F2.sp6:130920.Seq 
90.G2.sp6:130932.Seq 
90.H2.sp6:130944.Seq 
90.A3.sp6: 130861. Seq 
90.B3.sp6: 130873. Seq 
90.C3.sp6:130885.Seq 
90.D3.sp6:130897.Seq 
90.E3.sp6:130909.Seq 
90.F3.sp6: 130921. Seq 
90.G3.sp6:130933.Seq 
90.H3.sp6:130945.Seq 
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PCT/US98/27610 
Clone Name 

M00001549B:F06 

M00001549C:E06 

M00001550A:A03 

M00001550A:G01 

M00001551A:G06 

MO0OO1551C:G09 

M00001552B:D04 

M00001552D:A01 

M00001553A:H06 

M00001553B.F12 

M00001553D:D10 

M00001555A:B02 

M00001555A:C01 

M00001555D:G10 

M00001556A:C09 

M00001556A:F11 

M00001556B:C08 

M00001556B:G02 

M00001557B:H10 

M00O01557D:DO9 

M00001558A:H05 

M00001558B:H11 

M00001559B:F01 

M00001560D:F10 

M00001563B:F06 

M00001566B-.D11 

M00001571C:H06 

M00001579D:C03 

M00001583D:A10 

M00001590B:F03 

M00001594B:H04 

M00001597C:H02 

M00001597DC05 

M00001598A:G03 

M00001601A:D08 

M00001607A:E11 

M00001608A:B03 

M00001608B:E03 

M00001608D:A11 

M00001614C:F10 

M00001617C:E02 

M00001619GF12 

M00001621C:C08 



BNSDOCID: <WO 9933982A2_I_> 




WO 99/33982 



SEQ ID NO: 


Cluster D3 


Sequence Name 


686 


13913 


90.A4.sp6:130862.Seq 


687 


3277 


90.B4.sp6: 1 30874.Seq 


688 


4309 


90.C4.sp6:130886.Seq 


689 


5214 


90.D4.sp6:130898.Seq 


690 




90.E4.sp6 : 1 3 09 1 0.Seq 


691 


39171 


90.F4.sp6: 1 30922.Seq 


692 


19267 


90 .G4 . sp6 : 1 3 093 4. Seq 


693 


4665 


90.H4.sp6: 1 3 0946.Seq 


694 




90.A5.sp6: 130863. Seq 


695 


23201 


90.B5.sp6: 130875. Seq 


696 


76760 


90.C5.sp6: 130887. Seq 


697 


23218 


90 .D5 .sp6: 1 30899. Seq 


698 


35702 


90.E5.sp6: 130911. Seq 


699 


6468 


90.F5.sp6:130923.Seq 


700 


14367 


90.G5.sp6:130935.Seq 


701 


7015 


90.H5.sp6:130947.Seq 


702 


8773 


90.A6.sp6:130864.Seq 


703 


11460 


90.B6.sp6:130876.Seq 


704 


7570 


90.D6.sp6:130900.Seq 


705 


4416 


90.E6.sp6:130912.Seq 


706 


6660 


90.F6.sp6:130924.Seq 


707 




90.H6.sp6: 1 30948.Seq 


708 


26875 


90.A7.sp6:130865.Seq 


709 


6298 


90.B7.sp6:130877.Seq 


710 


78091 


90.C7.sp6:130889.Seq 


711 


10751 


90.D7.sp6: 130901. Seq 


712 


10539 


90.F7.sp6:130925.Seq 


713 


17055 


90.G7.sp6: 1 30937.Seq 


714 


5382 


90.A8.sp6: 1 30866.Seq 


715 


4393 


90.B8.sp6: 1 30878.Seq 


716 


67252 


90.C8.sp6: 1 30890.Seq 


717 


40108 


90.D8.sp6: 130902.Seq 


718 


11476 


90.E8.sp6:130914.Seq 


719 




90.F8.sp6: 1 30926.Seq 


720 


697 


90.G8.sp6:l30938.Seq 


721 




90.H8.sp6: 13O950.Seq 


722 


17076 


90.A9.sp6: 1 30867. Seq 


723 


3108 


90.B9.sp6: 130879.Seq 


724 


67907 


90.C9.sp6: 1 30891 .Seq 


725 




90.D9.sp6:130903.Seq 


726 


11350 


90.F9.sp6:130927.Seq 


727 


7899 


90.H9.sp6:130951.Seq 


728 


7798 


90.A10.sp6:130868.Seq 
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^ PCT/US98/27610 
Clone Name 

M0OO01623D:F10 

M00001624A:B06 

M00001624C:F01 

M00001630B:H09 

M00001632D:H07 

M00001644C:B07 

M00001645A:C12 

M00001648C:A01 

M00001651A:H01 

M00001657D:C03 

M00001657D:F08 

M00001662C:A09 

M00001663A:E04 

M00001669B:F02 

M00001670C:H02 

MOOO01673C:H02 

M00001675A:C09 

M00001676B:F05 

MOOO01677D:A07 

M00001678D:F12 

M00001679A:A06 

M00001679A:F06 

M00O01679A:FlO 

M00001679B:F01 

M00001679C:F01 

M00001679D:D03 

M00001680D:F08 

M00001682C:B12 

M00001688C:F09 

M00001693C:G01 

M00001716D:H05 

M00O03741D:C09 

M00003747D:C05 

M00003754C:E09 

M00003759B:B09 

M00003761D:A09 

M00003762C:B08 

M00003763A:F06 

M00003774C:A03 

M00003784D:D12 

M00003826B:A06 

M00003837D:A01 

M00003839A:D08 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 
SEQ ID NO: Cluster ID 



729 


6539 


730 


6874 


731 




732 


13595 


733 


5619 


734 


10515 


735 


4622 


736 


3389 


737 


4718 


738 




739 


12977 


740 


8479 


741 




742 


7798 


743 


5345 


744 


31587 


745 


14507 


746 


13576 


747 




748 


9285 


749 


39809 


750 


16317 


751 


8672 


752 


12532 


753 


3900 


754 


23255 


755 


24488 


756 


40122 


757 


23210 


758 


23358 


759 


3430 


760 


2433 


761 


9105 


762 


6124 


763 


40073 


764 


37285 


765 


17036 


766 


3706 


767 




768 


15069 


769 


9285 


770 


6880 


771 


5325 



Sequence Name 

90.B10.sp6:130880.Seq 

90.C10.sp6:130892.Seq 

90.D10.sp6:130904.Seq 

90.E10.sp6:130916.Seq 

90.F10.sp6:130928.Seq 

90.G10.sp6:130940.Seq 

90.H10.sp6:130952.Seq 

90.All.sp6:130869.Seq 

90.Bll.sp6:130881.Seq 

90.C1 l.sp6:l 30893. Seq 

90.Fll.sp6:130929.Seq 

90.Gll.sp6: 130941. Seq 

90.Hll.sp6: 130953. Seq 

90.A12.sp6:130870.Seq 

90.B12.sp6:130882.Seq 

90.C12.sp6:130894.Seq 

90.D12.sp6:130906.Seq 

9O.F12.sp6:130930.Seq 

90.G12.sp6:130942.Seq 

90.H12.sp6:130954.Seq 

99.Al.sp6:131230.Seq 

99.Bl.sp6:131242.Seq 

99.Cl.sp6:131254.Seq 

99.Dl.sp6:131266.Seq 

99.El.sp6:131278.Seq 

99.Fl.sp6:131290.Seq 

99.C2.sp6:131255.Seq 

99.D2.sp6:131267.Seq 

99.E2.sp6:131279.Seq 

99.F2.sp6: 13 1291. Seq 

99.A3.sp6:131232.Seq 

99.B3.sp6:131244.Seq 

99.C3.sp6:131256.Seq 

99.D3.sp6:131268.Seq 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

99.A4.sp6:131233.Seq 

99.C4.sp6:131257.Seq 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

99.H4.sp6:131317.Seq 

99.A5.sp6:131234.Seq 

99.C5.sp6:131258.Seq 

158 



PCT/US98/27610 
Clone Name 

M00003844C:BI 1 

M00003846B:D06 

M00003851B:D08 

M00003851B.D10 

M00003853A:D04 

M00003853A:F12 

M00003856B:C02 

M00003857A:G10 

M00003857A:H03 

M00003867A:D10 

M00003875B:F04 

M00003875C:G07 

M00003875D:DI1 

M00003876D:E12 

M00003879B:C11 

M00003879B:D10 

M00003879D:A02 

M00003885C:A02 

M00003891C:H09 

M00003906C:E10 

M00003907D:A09 

M00003907D:H04 

M00003909D:C03 

M00003912B:D01 

M00003914C:F05 

M00003922A:E06 

M00003968B:F06 

M00003970C:B09 

M00003974D:E07 

M00003974D:H02 

M00003981A:E10 

M00003982C:C02 

M00003983A:A05 

M00004028D:A06 

M00004028D:C05 

M00004035C:A07 

M00004035D:B06 

M00004068B:A01 

M00004072A:C03 

M00004081C:D10 

M00004086D:G06 

M00004087D:A01 

M00004093D:B12 



BNSDOCID; <WO 9933982A2_L> 




PCT/US98/27610 



SEQ ID NO: 


Cluster ID 


Sequence Name 


Clone Name 


772 


7221 


99.D5.sp6:131270.Seq 


M00004 1 05C: A04 


773 


4937 


99.E5.sp6:131282.Seq 


m M f\f\r\f\ A 1 AO A . IT A/C 

M00004 1 08 A: bUo 


774 


6874 


99.F5.sp6:131294.Seq 


» m (\f\f\f\A ill n\. A AO 

1V100004I I lL):AUo 


775 


13183 


99.G5.sp6:131306.Seq 


X >f AAA A/I 1 1 Af^.TZI 1 

M00U041 14C:r 1 1 


776 




99.H5.sp6:131318.Seq 


m m A AAA A lllD.rAl 

M00004121B:OU1 


777 


13272 


99.A6.sp6: 13 1235. Seq 


X>f AAA A /I HOD ."LJAO 

MUUUU4 1 Jot>:HUz 


778 


5257 


99.B6.sp6: 1 3 1 247. Seq 


lV/fAAAAyl 1 AiLf^'C^ \ 1 


779 


6455 


99.D6.sp6: 1 3 1 27 1 .Seq 


\A AAAA/1 1 Zir*. A AO 

MUUUU4 1 D /U.AUy 


780 


5319 


99.E6.sp6: 13 1283. Seq 


IV /f AAA A A 1 /CfkP .ff) 

M00004 1 69C:C 1 2 


781 


4908 


99.F6.sp6:131295.Seq 


X>f A AAA /I 171 FA. QAl 

M00004 1 7 1 D: r>Oi 


782 


11494 


99.G6.sp6:131307.Seq 


x>f aaaa/1 i lor'.nnc 


783 


11443 


99.A7.sp6:131236.Seq 


M00004185C:Cu3 


784 




99.B7.sp6:131248.Seq 


M00004191D:B1 1 


785 


8210 


99.C7.sp6:131260.Seq 


M00004 1 97D:H0 1 


786 


14311 


99.D7.sp6:131272.Seq 


» A A AAA A T~i . /"* 1 O 

M00004203B:C 1 2 


787 




99.E7.sp6:131284.Seq 


M00004205D:F06 


788 


12971 


99.B8.sp6:131249.Seq 


M00004223D:b04 


789 


6455 


99.C8.sp6: 13 1261. Seq 


M00004229B:r08 


790 


7212 


99.D8.sp6:131273.Seq 


m jb /\r\r\/\ A^s^y ATI . PAH 

M00004230B:C07 


791 


4905 


99.H8.sp6: 13 1321. Seq 


M00004269D:D0o 


792 


16914 


99.A9.sp6:131238.Seq 


\ lAAAA/tlTCA.P 1 1 

M00004275C:C1 1 


793 


16921 


99.D9.sp6:131274.Seq 


■fc iff A /\ A A >11Af A.C 1 O 

M00004295D:r 12 


794 


13046 


99.E9.sp6:131286.Seq 


» f AAA A >1 O AZr/*^ . T IA*7 

M00004296C:H07 


795 


9457 


99.F9.sp6:131298.Seq 


lk 1AAAAX "> /\ "7/^ . * A^ 

M00004307C:A0o 


796 


26295 


99.G9.sp6:131310.Seq 


M000043 1 2A:Cj03 


797 


21847 


99.H9.sp6:131322.Seq 


IV vf A AAA/1 1 i Qp.ni n 

MUUUU43 1 ot.Ul U 


798 




99.H10.sp6: 13 1323. Seq 




799 




99.B1 l.sp6:131252.Seq 


X>f AAAA/I^QO A *T-inft 


800 




99.D1 l.sp6:131276.Seq 


x/iAAAAc i onr* - 


801 


39304 


RTA00000 1 1 8A.J.2 1 . 1 .SeqJTHC 1 5 1 859 




802 


2428 


RTA00000 1 23 A.1.2 1 . 1 .Seq_THC205063 




803 


1058 


RTAOO0OO 1 26A.e.20.3 .Seq_THC2 1 7534 




804 


5097 


RTA 00000 1 34A.k. 1 . 1 .Seq_THC2 1 5 869 




805 


20212 


RTA00000 134A.1.22. 1 .Seq_THC 1 28232 




806 


23255 


RTA00000 1 77AF.e. 1 4.3 .Seq_THC228776 




807 


2790 


RTA00000177AF.e.2.1.Seq_THC229461 




808 


6420 


RTA00000 1 77 AFX 1 0.3 .Seq_THC226443 




809 


4059 


RTA00000177AF.n.l 8.3. Seq_THC 123051 




810 




RTA00000 1 79AF.J .13.1 .Seq_THC 1 05720 




811 


9952 


RTAOOOOO 1 80AF.C.20. 1 .SeqJTHC 1 62284 




812 


13238 


RTA0O0OO 1 8 1 AF.m.4. 1 .Seq_THC 1 4069 1 




813 


9685 


RTAOOOOO 1 83 AF.c. 11.1 .Seq_THC 1 09544 




814 




RTAOOOOO 1 83 AF.c.24. 1 .SeqJTHC 125912 





BNSDOCID: <WO 9933982A2J_> 





• 




WO 99/33982 




oH>v<l ID iNO; 


Cluster ID 


otHjuence in a me 


8 ! 5 


6420 


Jv 1 /AUUUUU I ojnr .u. 1 1 . i -oeCJ_ I HC22644-) 


016 


6974 


Jv 1 auuuuu 1 0 J Ar .Q.y. 1 .oCC] I HC223 J 29 


C 1 *7 
O I / 


A f\(\A A 

4UU44 


iv 1 auuuuu I oJAr .g.Zz. 1 .beq_ 1 HC232899 


olo 




iv 1 /\uuuuu i oiAr .g.9. 1 .oeq_ 1 HC 1 98280 


819 


5892 


iv 1 nuuuvu j 0*4 Ar .a. 1 1 . 1 .oCC] 1 HC J 0 1 07O 


o2U 


A f\(\A A 

40044 


xv 1 /\uuuuu 1 ooAr .a. 1 . 1 .i>eq_ 1 HC232o9y 


82 1 




iv 1 auuuuu 1 oOAr .n. 1 4. 1 .oeci_ 1 HC J 1 2 j2 j 


822 


19267 


rta nnnnn 1 a ft 1 10 1 tuphoicj 

iv 1 avuuuuu I oOAr.l. 1 Z. 1 .oeq__ 1 HC I /o 1 83 


cot 
oz 3 


8773 


iv 1 auuuuu 1 0 /Ar .1.24. 1 .beq_ 1 HC220002 


A 


*7 C "7 A 

7570 


iv 1 auuuuu 1 0 /Ar .g.24. l .oeq_I HC 1 68636 


825 


1 1476 


iv i r\\j\j\j\j\J 1 O / /\r .p. 1 y. 1 .oCC] 1 HC 1 Uo4o2 


826 




RTAOOnnni 88 AP A 11 i Qai Tunnno/i 
iv i auuuuu i ooAr .a. 1 1.1 -^CCL * HC2 1 2U94 


827 


17076 


k 1 auuuuu 1 ooAr ,a.2 1 . 1 .beq_ I HC208760 


0 0 0 
828 


697 


iv 1 auuuuu 1 ooAr .a.o. 1 .beq_ 1 HC 1 78884 


829 


67907 


k 1 auuuuu 1 ooAr .g. 1 1 . 1 .beq_ IriC 1 23222 


830 


5619 


iv 1 auuuuu i ooAr.I.V. 1 .oeq__ 1 HC 1 6 /845 


831 


4718 


k i auuuuu I oVAr . 1 .Seq_ I He 1 96 1 02 


832 


39809 


RTAnrtflfiniOAAC ^ ^ 1 c a.*. TUPunin 

iv 1 auuuuu 1 yuAr .e. J . J .oeq 1 HC 1 502 1 7 


833 


23255 


dta nnnAA 1 OA a it : a 1 c^.^. tt r/^noiT/'' 

iv 1 auuuuu 1 yUAr j.4. 1 .beq_ 1 HC228776 


834 


40122 


rxa nnnnn 1 on a « 01 1 c fl n TupiAmoi 
iv 1 /\uuuuu 1 yuAr .n.ZJ . 1 .becj_ i HC I U922 / 


835 


23210 


iv 1 auuuuu I yuAr .o.ZU. 1 .beq_ i HC207240 


836 


23358 


RTAnnnnnionAc a 01 1 c^^, Tur-^A^/iA 
iv i auuuuu 1 yUAr .o.z I . J .oeq_ 1 HC207240 


837 


5693 


RTAonnnni on apt n n i c^^. tup 1 m 1 0 
iv 1 /\k)\j\j\j\j 1 yuAr .p. 1 /.2.lSeq_ 1 HC 1 733 1 8 


0 1 0 
838 


2433 


iv i auuuuu J y 1 Ar .a. 1 j.2.oecj_ 1 HC79498 


839 


5257 


iv 1 auuuuu I yzAr .1. J . 1 .oeq 1 HC2 13833 


840 


16392 


RTAnnnnniooAc 1 i 1 c^.^. Tunmmi 
iv 1 auuuuu i yzAr .1. 1 . i 1 HC20207 1 


84 I 




K i AUUUUU 1 yj Ar x.2 1 . 1 .Seq__THC222602 


842 


26295 


iv i auuuuu l yj Ar .i.24.2.beq_ 1 HC 1 97345 


843 




r t a nnnnn i a p m ^ i tup i n 110 
iv i jwjww 1 y j Ar .m. j . 1 .oecL l HC 1 155 1 8 


844 




RTA000001 93 AF.n. 15.1 .Seq_THC2 1 5687 



PCT/US98/27610 
Clone Name 
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Table 2 





Nearest 

(BlastN vs. 
Genbank) 






Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


1 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


2 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


3 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


4 


<NONE> 


<NONE> 


<NONE> 


BAR3_CHITE 


BALBIANI RING 
PROTEIN 3 
PRECURSOR>PI R2 :S 
08167 Balbiani ring 3 
protein - midge 
(Chironomus 
tentans)>GP:CTBR3_l 
C;tentans balbiani ring 
3 (BR3) gene 


1 


5 


<NONE> 


<NONE> 


<NONE> 


CYAA PODA 
N 


ADENYLATE 
CYCLASE (EC 
4.6.1.1) (ATP 
PYROPHOSPHATE- ! 
LYASE) (ADENYLYL 
CYCLASE)>PIR2:JC4 
747 adenvlate cvclase 
(EC4.6.1.1)- 
Podospora 

anserina>GP:PANADC 
Y_l Podospora 
anserina adenyl cyclase 
gene, exons 1-4 


1 


6 


<NONE> 


<NONE> 


<NONE> 


VP03_HSVSA 


PROBABLE 
MEMBRANE 
ANTIGEN 3 
(TEGUMENT 
PROTEIN)>PIR2:C36 
806 hypothetical 
protein ORF3 - 
saimiriine herpesvirus 1 
(strain 

11)>GP:HSGEND_3 
Herpesvirus saimiri 
complete genome 
DNA; ORF 03; 
similarity to ORF 75 
and EBVBNRF1 


0.97 \ 



161 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 




Table 2 





i Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


7 


<NONE> 


<NONE> 


<NONE> 


ATFCA2J8 


Arabidopsis thaiiana 
DNA chromosome 4, 
ESSA 1 contig 
fragment No; 2; 
Hydroxyproline-rich 
glycoprotein homolog; 
Similarity to 
hydroxyproline-rich 
glycoprotein precursor- 
common tobacco 


0.93 


8 


<NONE> 


<NONE> 


<NONE> 


DHAL ASPN 
G 


ALDEHYDE 
DEHYDROGENASE 
(EC 1.2.1.3) 
(ALDDH)>GP:ASNA 
LDAA1 Aspergillus 
niger aldehyde 
dehydrogenase (aldA) 
gene, complete cds 


0.9 


9 


<iNUNb> 


<NONE> 


<NONE> 


NCU50264_1 


Neurospora crassa two- 
component histidine 
kinase (nik- 1 ) gene, 5* 
region and partial cds 


0.86 


10 


<NONE> 


<NONE> 


<NONE> 


NEUG BOVI 
N 


NEUROGRANIN 
(PI 7) (B-50 
IMMUNORE A CTI VE 
C-KINASE 
SUBSTRATE) 
(BICKS) 

(FRAGMENT)>PIR2: 
A39034 neurogranin - 
bovine (fragment) 


0.82 


11 


<NONE> 


<NONE> 


<NONE> 


HUMBYSTIN 
J 


Homo sapiens bystin 
mRNA, complete cds 


0.81 


12 


<NONE> 


<NONE> 


<NONE> 


BTBMP1J 


Bos taurus BMP1 gene, 
partial sequence; Bone 
morphogenetic protein 
1 


0.69 


13 


<NONE> 


<NONE> 


<NONE> 


TCCYSPROT 
_1 


T;congolense mRNA 
for (prepro) cysteine 
proteinase 


0.56 


14 


<NONE> 


<NONE> 


<NONE> 


P60_LISIV 


PROTEIN P60 
PRECURSOR 
(INVASION- 
ASSOCIATED 
PROTEIN)>GP:LISIA 
PRELBJ Listeria 


0.15 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIF1ION 


P 

VALUE 












ivanovii extracellular 
protein homologue 
(iap) gene, complete 
cds 




15 


<NONE> 


<NONE> 


<NONE> 


HEXADE3 1 


HEXON PROTEIN 
(LATE PROTEIN 2) 
(FRAGMENT)>PIR2: 
S3721 7 hexon protein - 
human adenovirus 31 
(fragment)>GP:HSAT3 
1H1 H;sapiens 
adenovirus type 31 
hexon gene; Hexon 
protein; Internal 
fragment containing 
hypervariable regions 


0.15 


16 


<NONE> 


<NONE> 


<NONE> 


HSU77493J 


Human Notch2 mRNA, 
partial cds; 
Transmembrane 
protein; hN 


0.13 


17 


<NONE> 


<NONE> 


<NONE> 


CYB_PARTE 


CYTOCHROME B 
(bL 

1.10.2.2)>PIR2:S07743 
cytochrome b - 
Paramecium tetraurelia 
mitochondrion 
(SGC6)>GP:MIPAGE 
N_19 Paramecium 
aurelia mitochondrial 
complete genome; 
Apocytochrome b (AA 
1-391) 


0.078 


18 


<NONE> 


<NONE> 


<NONE> 


HUMERB27_ 
1 


Human c-erbB-2 gene, 
exon 7; C-erb-2 protein 


0.054 


19 


<NONE> 


<NONE> 


<NONE> 


DMTRXIII_2 


D;melanogaster DNA 
for trxl and trxll genes; 
Trithorax protein trxl; 
Trithorax; 

putative>GP:DMTTHO 
RAXJ2 

D;melanogaster DNA 
for (putative) trithorax 
protein; Predicted 
trithorax protein 


0.047 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


20 


<NONE> 


<NONE> 


<NONE> 


CELB0281_5 


Caenorhabditis elegans 
cosmid B0281; Similar 
to reverse transcriptases 


0.043 


21 


<NONE> 


<NONE> 


<NONE> 


MOTY VIBP 
A 


SODIUM-TYPE 
FLAGELLAR 
PROTEIN MOTY 
PRECURSOR>GP:VP 
U06949_4 Vibrio 
parahemolyticus 
BB22 RNase T (rnt) 
gene and flagellar 
motor component 
(motY) gene, complete 
cds 


0.041 


22 


<NONE> 


<NONE> 


<NONE> 


A56263 


hptfi-crfi laptop iH»Qf» tV^C^ 

3.2.1.23) isozyme 12- 
Arthrobacter sp. (strain 
B7)>GP:ASU17417_1 
Arthrobacter sp; beta- 
galactosidase gene, 
complete cds 




23 


<NONE> 


<NONE> 


<NONE> 


GSA_PSEAE 


GLUTAMATE-] - 
SEMIALDEHYDE 
2,1-AMINOMUTASE 
(EC 5.4.3.8) (GSA) 
(GLUTAMATE- 1 - 
SEMIALDEHYDE 
AMINOTRANSFERA 
SE) (GSA- 
AT)>PIR2:S57898 
glutamate 1- 
semialdehyde 2,1- 
aminomutase - 
Pseudomonas 
aeruginosa>GP:PAHE 
ML_1 P;aeruginosa 
hemL gene; Glutamate 
1-sem 


0.038 


24 1 


<NONE> 


<NONE> 


<NONE> 


SI 6323 


hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHB 1_1 
A ; thai i ana homeobox 
gene Athb-1 mRNA; 
Open reading frame | 


0.035 1 
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Nearest 

Neighbor ! 

(BlastN vs. 

Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEC 
ID 


> 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


25 


<NONE> 


<NONE> 


<NONE> 


IRS1JIAT 


INSULIN RECEPTOR 
SUBSTRATE- 
1>PIR2:S 16948 
hypothetical protein 
IRS-1 - 

rat>GP:RNIRSHRMJ 
R;Norvegicus IRS-1 
mRNA for insulin- 
receptor; During insulin 
stimulation, undergoes 
tyrosine 

phosphorylation and 
binds 

phosphatidyl inositol 3- 
kinase 


0.027 


26 


<NONE> 


<NONE> 


<NONE> 


CEM02G9_2 


Caenorhabditis eiegans 
cosmid M02G9; 
M02G9;1; Similar to 
keratin like protein; 
cDNA EST 
yk308gl 1;5 comes 
from this gene; cDNA 
EST yk208ell;5 comes 
from this gene; cDNA 
EST yk208ell;3 comes 


0.0088 


27 


<NONE> 


<NONE> 


<NONE> 


S75490_3 


competence region: 
iga=IgA protease, 
comA=transformation 
competence [Neisseria 
gonorrhoeae, MSI 1, 
Genomic, 3 genes, 
2664 nt] 


0.0041 


28 


<NONE> 


<NONE> 


<NONE> 


EXTNJTOBA 


EXTENSIN 

i rvCL- vj r\jUJ\ ^^CL^JL. 

WALL 

HYDROXYPROLINE- 
RICH 

GLYCOPROTElN)>PI 
R2:S06733 
hydroxyproline-rich 
glycoprotein precursor 
- common 

tobacco>GP:NTEXT_l 
Tobacco HRGPnt3 
gene for extensin; 
Extensin (A A 1-620) 


0.0025 
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: :/:v|f; : 


| Nearest 
| Neighbor 
\ (BlastN vs. 
1 Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


! p ; 

VALUE 


29 


<NONE> 


<NONE> 


<NONE> 


HPCEGSJ 


Hepatitis C virus 
complete genome 
sequence; Polyprotein 


0.0014 


30 


<NONE> 


<NONE> 


<NONE> 


HHVBC_4 


Human hepatitis virus 
(genotype C, HMA) 
preSl,preS2, S, C, X, 
antigens, core antigen, 
X protein and 
polymerase 


0.00093 


31 


<NONE> 


<NONE> 


<NONE> 


HSLTGFBP4 
1 


Homo sapiens mRNA 
for latent transforming 
growth factor-beta 
binding protein-4; 
Latent TGF-beta 
binding protein-4 


0.00061 


32 


1 <NONE> 


<NONE> 


<NONE> 


S74909 


transposase - 
Synechocystis sp. (PCC 
6803)>GP:D90909_10 
8 Synechocystis sp; 
PCC6803 complete 
genome, 11/27, 
1311235- 1430418; 
Transposase; 
ORFJD:slr2062 


0.00051 


33 


<NONE> 


<NONE> 


<NONE> 


GRN MOUS 
E 


GRANULINS 
PRECURSOR 
(ACROGRANIN)>GP: 
MUSAPl Mouse gene 
for acrogranin 
precursor, complete cds 


0.00022 


34 


<NONE> 


<NONE> 


<NONE> 


CA21 MOUS 
E 


PROCOLLAGEN 
ALPHA 2(1) CHAIN 
PRECURSOR>PIR2:A 
43291 collagen alpha 
2(1) chain precursor - 
mouse>GP:MMCOL 1 
A2_l Mouse COL 1A2 
mRNA for pro-alpha- 
2(1) collagen 


0.00016 


35 


<NONE> 


<NONE> 


<NONE> 


MMMHC29N 
7_2 


Mus muscuius major 
histocompatibility 
locus class III 
region:butyrophiIin-like 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 


8.00E-05 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












alpha, palmitoyl- 




36 




vviN 




NFH RAT 

INI I 1 I v/i 1 


NEUROFILAMENT 
TRIPLET H PROTEIN 
(200 KD 

NEUROFILAMENT 
PROTEIN) (NF-H) 
(FRAGMENT) 


2.40E-05 


37 


<NONE> 


<NONE> 


<NONE> 


HUMVWFM 
1 


Human von Willebrand 
factor mRNA, 3' end; 
Von Willebrand factor 
prepropeptide 


1.70E-05 


38 


<NONE> 


<NONE> 


<NONE> 


CGHU2E 


collagen alpha 2(XI) 
chain - human 
(fragment) 


2.00E-06 


39 


<NONE> 


<NONE> 


<NONE> 


A6U83 


hypothetical protein 
(sdsB region) - 
Pseudomonas sp. 


4.90E-08 


40 


<NONE> 


<NONE> 


<NONE> 


YM8L YEAS 
T 


HYPOTHETICAL 71.1 
KD PROTEIN IN 
DSK2-CAT8 
INTERGENIC 
REGION>PIR2:S5458 
5 hypothetical protein 
YMR278w - yeast 
(Saccharomyces 
cerevisiae)>GP:SC802 
1X4 S;cerevisiae 
chromosome XIII 
cosmid 8021 ; 
Unknown; YM8021;04, 
unknown, len: 622, 
CAI:0;16, 


1.50E-09 


41 


<NONE> 


<NONE> 


<NONE> 


MTCY2 10_31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY210;31, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


3.10E-10 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 






Nearest 
i^eignDor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


42 


<NONE> 


<NONE> 


<NONE> 


CEC01G10__5 


Caenorhabditis elegans 
cosmid C01G10, 
complete sequence; 
C01G10;8;CDNA EST 
CEMSC45R comes 
from this 

gene>GP:CEC01G10_ 
5 Caenorhabditis 
elegans cosmid 
C01G10; C01G10;8; 
CDNA EST 
CEMSC45R comes 
from this gene 


2.30E-12 


43 


<NONE> 


<NONE> 


<NONE> 


HSU15779_1 


Human p70 (ST5) 
mRNA, alternatively 
spliced, complete cds; 
Differentially 
expressed; alternatively 
spliced 


9.50E-14 






<NONt> 


<NONE> 


MTCY210_31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY210;31, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


1.70E-17 


45 


U61403 


Dictyostelium 
discoideum PrlA 
(prlA) mRNA, 
partial cds. 


1 


U93472_l 


Danio rerio PPARB 
gene, partial cds; 
Nuclear receptor C 
domain 


0.95 






Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from done 
F31D4; HTGS 
phase 1 . 


l 


U93472_l 


Danio rerio PPARB 
gene, partial cds; 
Nuclear receptor C 
domain 


0.94 


47 


L36557 


Oryza sativa 
(clone pRG3) 
repetitive 
element. 


I 


HSU61262J 


Human neogenin 
mRNA, complete cds 


0.89 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIKIIUN 


P 

VALUE 


48 


AF005898 


Homo sapiens 

Na,K-ATPase 

beta-3 subunit 

pseudogene, 

complete 

sequence. 


1 


LRP1_CHICK 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 
1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) 
(A2MR)>PIR2:A53102 
LDL receptor-related 
protein / alpha-2- 
macroglobulin receptor 
precursor - 

chicken>GP:GGLRPA 
2MR 1 G;gallus 
mRNA for LRP/alp 


0.85 


49 


U 18795 


Saccharomyces 
cerevisiae 
chromosome V 
cosmids 9669, 
8334,8199, and 
lambda clone 
1160. 


1 


NKC1 SQUA 
C 


BUMETANIDE- 
SENSITIVE SODIUM- 
(POTASSIUM)- 
CHLOR1DE 
COTRANSPORTER 2 
(NA-K-CL 

SYMPORTER)>PIR2: 
A53491 bumetanide- 
sensitive Na-K-C 1 
cotransporter - spiny 
dogfish>GP:SANKCC 
1 1 Saualus acanthias 
bumetanide-sensitive 
Na-K-Cl cotransport 
protein (NKCC 


0.73 


50 


AC002523 


Homo sapiens; 
HTGS phase 1, 
54 unordered 
pieces. 


1 


BXEN CLOB 

! o 


BOTULINUM 
NEUROTOXIN TYPE 
E, NONTOXIC 
COMPONENT>GP:C 
LOENT120J 
C;botulinum gene for 
nontoxic component of 
progenitor toxin, 
complete cds 


0.71 
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1 Nearest 
■ Neighbor 
|( Blast N vs. 
1 Gen bank) 






Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


(accession 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


51 


1 AC002345 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Human i /; 
HTGS phase 1, 
1 0 unordered 
pieces. 


1 


P3K2_D1CDI 


PHOSPHATIDYLINO 
SITOL 3-KINASE2 
(EC 2.7. 1.1 3 7) (PI3- 
KINASE) (PTDINS-3- 
KINASE) 

(PI3K)>GP:DDU23477 
_1 Dictyosteiium 
discoideum 
phosphatidyl inositol- 
4,5-diphosphate 3- 
kinase (PIK2) mRNA, 
complete cds 


0.58 j 




VI zio^l 


Human mKNA 
for cripto protein. 


1 


155651 


noradrenaline 
transporter - 
bovine>GP:BTU09 1 98 

1 Bos taurus 
noradrenaline 
transporter mRNA, 
complete cds 


0.55 I 


53 

54 1 


U23516 


Caenorhabditis 
elegans cosmid 
B0416. 


1 


169024 


MHC sex-limited 
protein - mouse 
(fragment)>GP:MUSM 
HC4AD_1 Mouse class 
III H2-Slp sex-limited 
protein gene, exons 1, 2 
and 3; MHC sex- 
limited protein 


0.47 J 


55 1 


AB006698 


Arabidopsis 
thaliana genomic 

chromosome 5, 
PI clone: 
MCL19. 


1 


S81293J 


LI {insertion sequence, 
provirus} [human 
papillomavirus type 6b 
HPV6b, KP4, Genomic 
Mutant, 121 nt]; 
Authors note this 
reading frame results 
from a 454 bp deletion 
and resulting 


0.25 1 




K03458 


Human 

immunodeficienc 
y virus type 1, 
isolate Zaire 6, 
vif, tat, rev, env, 
nef genes and 3 f 
LTR. 


] 


S13383 


h vrl rny vr*m 1 in r\ r» H 

glycoprotein - sorghum 


n oa 1 
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Nearest 
Neighbor 
(BiastX vs. 
Non- 
Redundant 
Proteins) 



SEQ 
ID 



ACCESSION 



56 



B26794 



T1016TR TAMU 
Arabidopsis 
thaliana genomic 
clone TIO 16. 



DESCRIPTION 



P 

VALUE 



ACCESSION 



1 



RK34JPORP 
U 



CHLOROPLAST 50S 

RIBOSOMAL 

PROTEIN 

L34>PIR2:S73111 

ribosomal protein L34 - 

red alga (Porphyra 

purpurea) 

chloroplast>GP:PPU38 
804_4 Porphyra 
purpurea chloroplast 
genome, complete 
sequence; 50S 
ribosomal protein L34 



DESCRIPTION 



P 

VALUE 



57 



Z98950 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
507115; HTGS 
phase 1. 



D4U32 



collagen-related protein 

- Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp.>GP:HSNCOL4_l 
Hydra N-COL 4 
mRNA for mini- 
collagen; No start 
codon 



58 



U57057 



Human WD 
protein IR10 
mRNA, complete 
cds. 



DMU 15602 1 



Drosophila 
melanogaster (zeste- 
white 4) mRNA, 
complete cds; Similar 
to C; elegans B0464;4 
gene product, Swiss- 
Prot Accession Number 
Q03562 



59 



U57057 



Human WD 
protein IR10 
mRNA, complete 
cds. 



CR2 MOUSE 



COMPLEMENT 
RECEPTOR TYPE 2 
PRECURSOR (CR2) 
(COMPLEMENT C3D 
RECEPTOR)>PIR2:A4 
3526 complement 
C3d/Epstein-Barr virus 
receptor 2 precursor - 
mouse>GP:MUSCR2A 
A_l Murine 
complement receptor 
type 2 (CR2) mRNA, 
complete cds; 
Complement receptor 
type 
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Nearest 
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(BlastX vs. 
Non- 
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Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 

VALUE 


60 


B65337 


CIT-HSP- 
2021H21.TF 
CIT-HSP Homo 
sapiens genomic 

o| nnP OOO 1 H9 1 

cione zuz i fiz i . 


1 


A3 8096 


perlecan precursor - 
human>GP:HUMHSP 
G2B1 Human heparan 
sulfate proteoglycan 
(HSPG2) mRNA, 
complete cds 


0.0051 


61 


U84722 


Human vascular 
endothelial 
cadherin mRNA, 
complete cds. 


1 


HSTAFII13J 


H;sapiens mRNA for 
TAFII135;Subunit of 
RNA polymerase 11 
transcription factor 
TFIID 


0.0012 


62 


L41493 


Avian rotavirus 
(strain turkey 1) 
genomic segment 
4 outer capsid 
protein (VP8*) 
gene. 


1 


Y328 MYCP 
N 


HYPOTHETICAL 
PROTEIN MG328 
HOMOLOG>PIR2:S73 
693 MG328 homolog 
P01_orfl033 - 
Mycoplasma 
pneumoniae (ATCC 
29342) 

(SGC3)>GP:MPAE000 
035_2 Mycoplasma 
pneumoniae from bases 
442306 to 452472 
(section 35 of 63) of 
the complete genome; 
MG328 homolog, 


0.00015 


63 


D63139 


Aeromonas sp. 
gene for 
chitinase, 
complete and 
partial cds. 


1 


MTCY16B7_3 


Mycobacterium 
tuberculosis cosmid 
SCY16B7; Unknown; 

1V1 1 I IOd /,Uj, 

initiation factor, len: 
900, similar at C- 
terminal half to eg 
IF2_BACSUP17889 
initiation factor if-2 
(716 aa), fasta 


6.30E-05 
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SEQ 
ID ! 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


64 


J04974 


Human alpha-2 
type XI collagen 
mRNA 
(COL11A2). 


1 


GDF6 BOVI 
N 


GROWTH/DIFFEREN 
TIATION FACTOR 
GDF-6 PRECURSOR 
(CARTILAGE- 
DERIVED 
MORPHOGENETIC 
PROTEIN 2) (CDMP- 
2) 

(FRAGMENT)>PIR2: 
B55452 cartilage- 
derived morphogenetic 
protein 2 precursor - 
bovine 

(fragment)>GP:BTU 1 3 
661_1 Bostaurus 
cartilage-derived morp 


1 .0OE-05 














65 


AC002394 


Homo sapiens 
Chromosome 16 
BAC clone 
CIT987-SKA- 
21 1C6 -complete 
genomic 
sequence, 
complete 
sequence. 


i ; 


CELC14F1 1 
6 


Caenorhabditis elegans 
cosmidC14F11; 
Similar to aspartate 
aminotransferase; 
coded for by C; elegans 
cDNA CEMSF95FB; 
coded for by C; elegans 
cDNA yk41e4;3; coded 
for by C; elegans 


4.60E-06 


66 


AB002312 


Human mRNA 
for KIAA0314 
gene, partial cds. 


i 


NAT1 YEAS 
T 


N-TERMINAL 
ACETYLTRANSFER 
ASE 1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 
ASE 1) 


1 .OOE-09 


67 


AC003085 


Human BAC 
clone RG094H21 
from 7q21-q22, 
complete 
sequence. 


i 


DP19_CAEEL 


DPY-19 

PROTEIN>PIR2:S446 
29 f22b7. 10 protein - 
Caenorhabditis 
elegans>GP:CELF22B 
7_9 C;aenorhabditis 
elegans (Bristol N2) 
cosmid F22B7; 
Putative 


4.20E-11 


68 


X55026 


P.anserina 
complete 
mitochondrial 
genome. 


i 


NAT1 YEAS 
T 


N-TERMINAL 
ACETYLTRANSFER 
ASE 1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 


8.40E-12 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












ASE 1) 




69 


Z95399 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39B6; HTGS 
phase 1. 


1 


CER06B9_5 


Caenorhabditis elegans 
cosmid R06B9, 
complete sequence; 
R06B9;b; Protein 
predicted using 
Genefinder; 
preliminary prediction 


1.50E-24 


70 


AC002339 


Arabidopsis 

thaliana 

chromosome II 

BACT11A07 

genomic 

sequence, 

complete 

sequence. 


0.99 


POLG BVDV 
S 


GENOME 

POLYPROTEIN>PIRl 
:A44217 genome 
polyprotein - bovine 
viral diarrhea virus 
(strain SD- 

l)>GP:BVDPOLYPR 
0_1 Bovine viral 
diarrhea virus 
polyprotein RNA, 
complete cds; Putative 


1 


71 


Y08559 


B.subtilis urease 
operon and 
downstream 
DNA. 


0.99 


LRPCAEEL 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 
PRECURSOR 
(LRP)>PIR2:A47437 
LDL-receptor-related 
protein - 
Caenorhabditis 
elegans>GP:CEF29D 1 
1_2 Caenorhabditis 
elegans cosmid 
F29D1K complete 
sequence; F29D11;1; 
Protein predicted using 
Genefl 


1 



174 



BNSDOCID: <WO 9933982A2_L> 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
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P 
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ACCESSION 
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P 

VALUE 


72 


U67548 


Methanococcus 
jannaschii from 
bases 986219 to 
996377 (section 
90 of 150) of the 
complete 
genome. 


0.99 


YB60 YEAS 
T 


HYPOTHETICAL 16.3 
KD PROTEIN IN 
DURL2-NGR1 
INTERGENIC 
REGlON>PIR2:S4608 
4 probable membrane 
protein YBR210w - 
yeast (Saccharomyces 
cerevisiae)>GP:SCYB 
R210W1 S;cerevisiae 
chromosome II reading 
frame ORF YBR210w 


1 


73 | 


U51645 


Plasmodium 
falciparum 
cytidine 
triphosphate 
synthetase gene, 
complete cds. 


0.99 


HPSVRPL_1 


Sin Nombre virus (NM 
H 1 0) RN A L segment 
encoding RNA 
polymerase (L protein), 
complete cds; Viral 
RNA polymerase (L 
protein); 

Putative>GP:HPSVRP 
LA 1 Sin Nombre 
virus (NM Rl 1) RNA 
L segment encoding 
RNA polymerase (L 
protein), complete cds; 
Vir 


099 


74 


Z49889 


Caenorhabditis 
elegans cosmid 
T06H11, 
complete 
sequence. 


0.99 


MUSHDPRO 
B_l 


Mouse alternatively 
spliced HD protein 
mRNA, complete cds 


0.021 


75 


Z69374 


Human DNA 
sequence from 
cosmid L174G8, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3 contains a 
pair of ESTs. 


0.99 


NCPR YEAS 
T 


NADPH- 

CYTOCHROME P450 
REDUCTASE (EC 
1.6.2.4) (CPR) 


0.017 
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SEQ 
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P 

VALUE 


ACCESSION 
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P 

VALUE 


76 


Z35847 


S.cerevisiae 
chromosome II 
reading frame 
ORF YBL086c. 


0.99 


CYPA CAEE 
L 


PEPTIDYL-PROLYL 
CIS-TRANS 
ISOMERASE 10 (EC 
5.2.1.8) (PPIASE) 
(ROTAMASE) 
(CYCLOPHILIN- 
10)>GP:CELB0252_4 
Caenorhabditis eiegans 
cosmid B0252; Similar 
to peptidyl-prolyl cis- 
trans isomerase 
(PPIASE) 

(CYCLOPHILIN)>GP: 
CEU34954J 
Caenorhabditis el 


0.0044 


77 


L35330 


Rattus norvegicus 
glutathione S- 

tranc"fV»rc»c£» VK1 

II dlloICI CDC I Uj 

subunit gene, 
complete cds. 


0.99 


CELR 148 J 


Caenorhabditis eiegans 
cosmid R148; Contains 
similarity to drosophila 
DNA-binding protein 
K10(NID:g8148); 
coded for by C; eiegans 

r»"TVM A i/h- 1 1 1 1 .f . 

cuina yKi loei 

coded for by C; eiegans 

cDNA 


0.0032 


78 


Y00324 


Chicken 

vitellogenin penp 
3' flanking 
region. 


0.99 


A56922 


transcription factor shn 
- fruit fly (Drosophila 
melanogaster) 


0.0023 


79 


M32659 


D.melanogaster 
Shabl 1 protein 
mRNA, complete 
cds. 


0.99 


OMU25146J 


Oncorhynchus mykiss 
recombination 
activating protein 2 
gene, partial cds 


0.0017 


80 


Z69880 


H.sapiens 
SERCA3 gene 
(partial). 


0.99 


M84D DRO 
ME 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD>PIR2:S257 
75 testis-specific 
protein Mst84Dd - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
MST84D_4 
D;melanogaster 
Mst84Da, Mst84Db, 
Mst84Dc and Mst84Dd 
genes for put; sperm 
protein j 


0.0011 
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Nearest 
Neighbor 
(BlastN vs. 
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Proteins) 


SEC 
ID 




ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


O 1 


M99166 


Escherichia coli 
Tip repressor 
binding protein 
(wrbA) gene, 
complete cds. 


0.99 


MTU88962J 


Mycobacterium 
tuberculosis unknown 
protein gene, partial cds 


6.50E-07 




X99257 


R.norvegicus 
mRNA for lamin 
C2. 


0.99 


M1U68729J 


Meloidogyne incognita 
cuticle preprocollagen 
(col-2) mRNA, 
complete cds; Putative 


1.60E-09 


83 


AC002432 


Human BAC 
clone RG317G18 
from 7q3 1 , 
complete 
sequence. 


0.98 


1FMDC 


Foot and mouth disease 
virus type c-s8cl, chain 
C - foot and mouth 
disease virus type c- 
s8cl expressed in 
hamster kidney cells 


0.14 


84 


Z34799 


Caenorhabditis 
elegans cosmid 
F34D10, 
complete 
sequence. 


0.98 


MMU57368J 


Mus musculus EGF 
repeat transmembrane 
protein mRNA, 
complete cds; Notch 
like repeats; notch 2 


0.0028 


85 


B 15207 


344E15.TV 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
344E15. 


0.98 


POLG HCVJ 
6 


GENOME 
POLYPROTEIN 
(CONTAINS: CAPSID 
PROTEIN C (CORE 
PROTEIN); MATRIX 
PROTEIN 
(ENVELOPE 
PROTEIN M); MAJOR 
ENVELOPE 
PROTEIN E; 
NONSTRUCTURAL 
PROTEINS NS1, NS2, 
NS4A AND NS4B; 
HELICASE(NS3); 
RNA-DIRECTED 
RNA POLYMERASE 
(EC 2.7.7.48) 
(NS5))>PI 


0.00083 
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SEQ 
ID 
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DESCRIPTION 


P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


86 


AC002412 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.98 


KDG1 ARAT 
H 


DIACYLGLYCEROL 
KINASE 1 (EC 
2.7.1.107) 
(DIGLYCERIDE 
KINASE) (DGK 1) 
(DAG KINASE 
1)>PIR2:S71467 
diacylglycerol kinase 
(EC 2.7.1.107) 
ATDGK1 - 
Arabidopsis 
thaliana>GP:ATHATD 
GKl l Arabidopsis 
thaliana mRNA for 
diacylglycerol kinase, 
complete c 


0.00024 


87 


X57010 


Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exons E2-E15. 


0.98 


D80O05J 


Human mRNA for 
KIAA0183 gene, 
partial cds 


5.90E-10 


88 


M83093 


Neurospora 
crassa cAMP- 
dependent protein 
kinase (cot- 1 ) 
gene, complete 
cds. 


0.98 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid cl3Al 1; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


3.00E-22 


89 


U96271 


Helicobacter 
pylori heat shock 
protein 70 
(hsp70) gene, 
cuiripicie cos. 


0.97 


SLMEN6_1 


S;Iatifolia mRNA for 
Men-6 

protein>GP:SLMEN6 
1 S;latifolia mRNA for 
Men-6 protein 


0.43 


90 


U49944 


Caenorhabditis 
elegans cosmid 
C39E6. 


0.97 


RON HUMA 
N 


MACROPHAGE 
STIMULATING 
PROTEIN RECEPTOR 
PRECURSOR (EC 
2.7.1.1 12)>PIR2:I3818 
5 protein-tyrosine 
kinase (EC 2.7.1.112), 
receptor type ron - 
human>GP:HSRONJ 
H;sapiens RON mRNA 
for tyrosine kinase; 
Putative 


0.034 
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Nearest 
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Non- 
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Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


91 


Y09255 


B.cereus dnal 
gene, partial. 


0.97 


CELT05C1_5 


Caenorhabditis elegans 
cosmid T05C1; Coded 
for by C; elegans 
cDNA ykJUio,J, coaea 
for by C; elegans 
cDNA yk34f!0;3 


0.00043 


92 


AC002413 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.96 


CELC44E4_5 


Caenorhabditis elegans 
cosmid C44E4; Weak 
similarity to the 
drosophila hyperplastic 
disc protein 
(GB:L 14644); coded 
for by C; elegans 
cDNA yk49h6;5; coded 
for by C; elegans 
cDNA 


1 


93 


U41625 


Caenorhabditis 
elegans cosmid 
K03A1. 


0.96 


HMGC HUM 
AN 


HIGH MOBILITY 
GROUP PROTEIN 
HMGI- 

C>PIR2:JC2232 high 
mobility group I-C 
phosphoprotein - 
human>GP:HSHMGIC 
G5_l Human high- 
mobility group 
phosphoprotein isoform 
I-C (HMGIC) gene, 
exon 

5>GP:HSHMGICP_1 
H;sapiens mRNA for 
HMGI-C 

protein>GP:HSHMGIC 


1 


94 


Z82202 


Human DNA 
secj ucnce 
SEQUENCING 
IN PROGRESS 
*** from clone 
34P24; HTGS 
phase 1 . 


0.96 


YTH3_CAEE 
i 


HYPOTHETICAL 75.5 
KD PROTEIN 
C14A4.3 IN 
CHROMOSOME 
II>GP:CEC14A4_3 
Caenorhabditis elegans 
cosmid C14A4, 
complete sequence; 
C14A4;3; Weak 
similarity with a B; 
Flavum translocation 
protein (Swiss Prot 
accession number 
P38376) 


0.73 
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Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
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SEQ 
ID 
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DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 




AL008734 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
324M8; HTGS 
phase 1. 


0.96 


S25299 


extensin precursor 
(clone Tom L-4) - 
tomato>GP:TOMEXT 
ENB_1 L;esculentum 
extensin (class II) gene, 
complete cds 


0.0004 


yo 


L15388 


Human G 
protein-coupled 
receptor kinase 
(GRK5) mRNA, 
complete cds. 


0.96 


HUMCOL7A1 
XJ 


Homo sapiens (clones: 
CW52-2, CW27-6, 
CW15-2,CW26-5, 11- 
67) collagen type VII 
intergenic region and 
(COL7Al)gene, 
complete cds 


4.60E-06 


97 


X97384 


A.thaliana atran3 
gene. 


0.95 


<NONE> 


<NONE> 


<NONE> 


98 


M62505 


Human C5a 
anaphylatoxin 
receptor mRNA, 
complete cds. 


0.95 


RIPBBRYDI 


RIBOSOME- 
INACTIVATING 
PROTEIN BRYODIN 
(RRNA N- 

GLYCOSIDASE)(EC 
3.2.2.22) 

(FRAGMENT)>PIR2: 
SI 6491 rRNA N- 
glycosidase (EC 
3.2.2.22) bryodin - red 
bryony (fragment) 


0.83 


99 


D28778 


Cucumber mosaic 
virus RNA 1 for 
la, complete 
sequence. 


0.95 


POLS RUBV 
M 


STRUCTURAL 
POLYPROTEIN 
(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND | 
E2)>PIR1:GNWVR3 
structural polyprotein - 
rubella virus (strain 
M33)>GP:TORUB24S 
_1 Rubella virus 24S 
subgenomic mRNA for 
structural proteins El, 
E2 and C; 


0.00037 


100 


AF016202 


Homo sapiens 
immunoglobulin 
heavy chain 
CDR3 gene, 


0.93 


HSU79716J 


Human reelin (RELN) 
mRNA, complete cds 


1 ; 
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SEQ 
ID 


ACCESSION 
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P 

VALUE 


ACCESSION 


DESCRIFIION 


P 

VALUE 






partial cds. 










101 


Z68303 


Caenorhabditis 
elegans cosmid 
ZK809, complete 
sequence. 


0.93 J 


HS5HT4SAR 
1 


H;sapiens mRNA for 
serotonin 4SA receptor 
(5-HT4SA-R) 


0.87 


102 


X03049 


E. coli DNA 
sequene 5' to 
origin of ; 
replication oriC. 


0.93 


S37594 


mucin - human 
(fragment) 


0.0019 


103 


M32659 


D.melanogaster 
Shabl 1 protein 
mRNA, complete 
cds. 


0.93 


S38480 


nonstructural protein - 
rubella 

virus>GP:RVM33NP_ 
1 Rubella virus M33 
RNA for a 

nonstructural protein; 
Nonstructural protein 
genes 


2.30E-06 


104 


D88687 


Human mRNA 

forKM-102- 

derived 

reductase-like 

factor, complete 

cds. 


0.93 


BAT3 HUMA 
N 


LARGE PROLINE- 
R1CH PROTEIN BAT3 
(HLA-B- 
ASSOCIATED 
TRANSCRIPT 
3)>P1R2:A35098 MHC 
class III 

histocompatibility 
antigen HLA-B- 
associated transcript 3 - 
human>GP:HUMBAT 
3A_1 Human HLA-B- 
associated transcript 3 
(BAT3) mRNA, 
complete 

cds>GP:HUMBAT3 


8.70E-07 


105 


D 16847 


Mouse mRNA for 
stromal cell 
derived protein- 1, 
complete cds. 


0.93 


S52796 


prpL2 protein - human 
(fragment)>GP:HSPRP 
L2_l H;sapiens mRNA 
for PRPL-2 protein 


3.20E-08 
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ID 

106 



ACCESSION 



D90915 



DESCRIPTION 



Synechocystis sp 

PCC6803 

complete 

genome, 17/27, 

2137259- 

2267259. 



P 

VALUE 



ACCESSION 



DESCRIPTION 



0.92 



YEK9_YEAS 
T 



HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37 20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 



P 

VALUE 



5.90E-05 



107 



AJ001101 



Mus musculus 
mRNA for 
gClqBP gene. 



0.92 



DMU58282 1 



108 



X57108 



Human gene for 
cerebroside 
sulfate activator 
protein, exons 10- 
14. 



0.92 



S69032 



Drosophila 
melanogaster Bowel 
(bowl) mRNA, 
complete cds; 
Transcription factor; 
C2H2 zinc finger 
protein; zinc fingers 
have extensive 
sequence similarity to 
Drosophila odd- 
skipped 



3.50E-05 



hypothetical protein 
YPR144c- yeast 
(Saccharomyces 
cerevisiae)>GP:YSCP9 
65 9 1 7 Saccharomyces 
cerevisiae chromosome 
XVI cosmid 9659; 
Yprl44cp; Weak 
similarity near C- 
terminus to RNA 
Polymerase beta 
subunit (Swiss Prot; 
accession number 
PI 1213) 



4.30E-21 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


109 


D 14635 


Caenorhabditis 
elegans DNA for 
EMB-5. 


0.91 


YM13 YEAS 
T 


PUTATIVE A TP- 
DEPENDENT RNA 
HELICASE 
YMR128W>PIR2:S53 
058 probable 
membrane protein 
YMR128w- yeast 
(Saccharomyces 
cerevisiae)>GP:SC955 
j *t o,ccrcvioiac 
chromosome XIII 
cosmid 9553; 
Unknown; YM9553;04, 
probable ATP- 

ucuciiuciii rvi^i^A. 

helicase, len: 


0.69 


110 


B55500 


CIT-HSP- 
387J2.TFB CIT- 
HSP Homo 
sapiens genomic 
clone 387J2. 


0.91 


U97553_79 


Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Unknown 


0.00016 


111 


X03049 


E. coli DNA 
sequene 5 f to 
origin of 
replication oriC. 


0.9 


POL_MLVAV 


POL POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(VC 111 4QV 
RIBONUCLEASE H 
(EC 

3.1.26.4))>PIR1:GNM 
VG V pol polyprotein - 
AKV murine leukemia 
virus 


0.0019 


112 


U91327 


Human 
chromosome 
12pl5 BAC clone 

V_l I 70 / OJ\"77L/0 

complete 
sequence. 


0.89 


JC5568 


serine protease (EC 
3.4.-.-) hi - Serratia 
marcescens 


1 


113 


XI 3295 


Rat mRMA for 
alpha-2u 
globulin-related 
protein. 


0.89 


MNGPOLYJ 


Mengo virus 
polyprotein genome, 
complete cds withe 
repeats 


1 


114 


Z78415 


Caenorhabditis 
elegans cosmid 
C17G1, complete 
sequence. 


0.89 


AB00012IJ 


Mouse mRNA for 
TBPIP, complete cds; 
TBP1 interacting 
protein 


0.39 



183 



9933982A2 I > 



WO 99/33982 PCT/US98/27610 

Table 2 





Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 
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P 
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ACCESSION 


DESCRIPTION 


P 1 
VALUE 


115 


AC002308 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 
22qll BAC 
Clone 1000e4; 
HTGS phase 1, 
26 unordered 
pieces. 


0.88 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D 1044.2 IN 
CHROMOSOME 
III>GP:CELD1044_4 
Caenorhabditis elegans 
cosmid D1044 


0.0037 


116 


AC002073 


Human PAC 
clone DJ515N1 
from 22qll.2- 
q22, complete 
sequence. 


0.88 


S28499 


probable finger protein 
-rat>GP:RNZFP_l 
R;norvegicus mRNA 
for putative zinc finger 
protein 


1.10E-31 


117 


Z83848 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1. 


0.87 


NDL DROM 

E 


SERINE PROTEASE 
NUDEL PRECURSOR 
(EC 3.4.21.- 
)>PIR2:A57096 nudel 
protein precursor - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
U29153_l Drosophila 
melanogaster nudel 
(ndl) mRNA, complete 
cds; Serine protease; 
Soma dependent gene 
required matern 


1 


118 


U23449 


Caenorhabditis 
elegans cosmid 

iy- r\/z a i 
KVOA 1 . 


0.87 


AF023268_3 


Homo sapiens clk2 
kinase (CLK2), 
propinl, cotel, 
glucocerebrosidase 
(GBA), and metaxin 
genes, complete cds; 
metaxin pseudogene 
and glucocerebrosidase 
pseudogene; and 
thrombospondin3 
(THBS3) 


0.21 


119 


Z68181 


H.vulgaris 
mRNA for 
elongation factor 
EF1 -alpha. 


0.87 


RABCY450C 
_1 


Rabbit cytochrome P- 
450 gene, clone pP- 
450PBc3, y end 


0.14 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


120 


ACUUUUJj 


riomo Sapiens 
chromosome 9, 
complete 
sequence. 


U.o / 


VWF PANF 

V VV 1 1 N 1 

A 


VON WILLEBRAND 
FACTOR 

PRECURSOR>GP:DO 
GVWG_1 Canis 
familiaris von 
Willebrand factor 
mRNA, complete cds 


0.036 f 


121 


U23449 


Caenorhabditis 
elegans cosmid 


0.86 


S48988J 


CRP- 1 =cystatin-related 
protein [rats, Wistar 
alhinn mRNA Partial 
213 nt]; Cy statin- 
related protein; 
Method: conceptual 
translation supplied by 
author* This seauence 
comes from Fig; 


0.64 


122 


Z89651 


F.rubripes GSS 
sequence, clone 
090I24cD5. 


0.86 


CPU65981_1 


Cryptosporidium 
parvum P-ATPase gene 
(CppA-El)gene, 
complete cds; Putative 
calcium-ATPase 


0.6 


123 


Z94055 


Human DNA 
sequence from 
PAC 24M15on 
chromosome 1. 
Contains 
tenascin-R 
(restrictin), EST. 


0.86 


GLTB SYNY 
3 


FERREDOXIN- 
DEPENDENT 
GLUTAMATE 
SYNTHASE 1 (EC 
1.4.7.1) (FD- 
GOG AT)>P1 R2 : S6022 
8 glutamate synthase 
(ferredoxin) (EC 
i 47 n pitB - 
Synechocystis sp. (PCC 
6803)>GP:D90902_66 
Synechocystis sp; 
PCC6803 complete 
genome, 4/27, 402290- 
524345; Gluta 


0.03 


124 


Z49250 


Human DNA 
sequence from 
cosmid HW2, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. 


0.86 


TRSCAPSID 
1 


Tobacco ringspot virus 
capsid protein gene, 
complete cds 


3.00E-06 
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SEQ 
ID 
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DESCRIPTION 


P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


125 


Z92855 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y48C3;HTGS 
phase 1. 


0.84 


AE000809_8 


Methanobacterium 
thermoautotrophicum 
from bases 161632 to 
172569 (section 15 of 
148) of the complete 
genome; Aspartyl-. 
tRNA synthetase; 
Function Code:10;07- 
Metabolism of 


1 


126 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana TAMU' 
BAC T11J7' 
genomic 
sequence near 
marker , m283'; 
HTGS phase 1,2 
unordered pieces. 


0.83 


CET01E8_3 


Caenorhabditis elegans 
cosmid T01E8, 
complete sequence; 
T01E8;3; Similar to 1- 
phosphatidylinositol- 
4,5-bisphosphate 
phosphodiesterase; 
cDNA EST CEESG02F 
comes from this gene; 


0.86 




AL008716 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
zUoCV; H 1 OS 
phase 1. 


0.83 


HIVU51189_5 


HIV-1 clone 93th253 
from Thailand, \ 
complete genome; Tat 
protein 


0.86 


128 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'TAMU 1 
BAC Til J7* 
genomic 
sequence near 
marker , m283 t ; 
HTGS phase 1,2 
unordered pieces. 


0.83 


S60257 


meltrin alpha - 
mouse>GP:MUSMAB 
_1 Mouse mRNA for 
meltrin alpha, complete 
cds 


0.0013 



186 



BNSDOCID: <WO 9933982A2_I_> 




PCT/US98/27610 



Table 2 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
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DESCRIPTION 
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P 
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129 


Z83848 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1 . 


0.82 


AROl PNEC 
A 


PENTAFUNCTIONAL 
AROM 

POLYPEPTIDE 
(CONTAINS: 3- 
DEHYDROQUINATE 
SYNTHASE (EC 
4.6.1.3), 3- 

DEHYDROQUINATE 
DEHYDRATASE (EC 
4.2.1.10) (3- 
DEHYDROQUINASE) 
, SHIKIMATE 5- 
DEHYDROGENASE 
(EC 1.1.1.25), 
SHIKIMATE KINASE 
(EC 2.7. 1 .7 1 ), AND 
EPSP SYNTHASE (E 


0.0098 i 


130 


AF029308 


Homo sapiens 
chromosome 9 
duplication of the 
T cell receptor 
beta locus and 
trypsinogen gene 
families. 


0.8 


CELZK84_5 


Caenorhabditis elegans 
cosmid ZK84; Final 
exon in repeat region; 
similar to long tandem 
repeat region of 
sialidase 

(SP:TCNA_TRYCR, 
rZizD 3 ) ana 
neurofilament H 
protein; coded for by C; 
elegans 


2.00E-08 


131 


AC002458 


Human BAC 
clone RG098M04 
from 7q21-q22, 

rT*mr\1f*tf* 

sequence. 


0.78 


IGF2_PIG 


INSULIN-LIKE 
GROWTH FACTOR II 
PRECURSOR (IGF- 
II)>GP:SSIGF2_l 
S;scrofa mRNA IGF2 
for insulin-like-growth 
factor 2; Insulin- like- 
growth factor 2 
preproprotein 


0.44 


132 


Z83843 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
368A4; HTGS 
phase 1. 


0.78 


PAR51AJ 


P;tetraurelia 51 A 
surface protein gene, 
complete cds 


0.0014 



187 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 PCT/US98/27610 

Table 2 





(Nearest 
1 Neighbor 
|(BlastN vs. 
1 Genbank) 






Nearest 

(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 
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P 
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P 

VALUE 


133 


1 X03021 


ouuiaii gene ior 
granulocyte- 
macrophage 
colony 

stimulating factor 
(GM-CSF). 


f\ HQ 


CEr57Bl_3 


Caenorhabditis elegans 
cosmid F57B1, 
complete sequence; 
F57B1;3; Protein 
predicted using 
Genefinder; similar to 
collagen 


2.20E-05 


134 


| Z74825 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w. 


0.77 


SYLM SCHP 
O 


PUTATIVE LEUCYL- 
TRNA 

SYNTHETASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
6.1.1.4) (LEUCINE- 
TRNA 

LIGASE)>PIR2:S6248 
6 hypothetical protein 
SPAC4G8.09- fission 
yeast 

(Schizosaccharomyces 
pombe)>GP:SPAC4G8 
_9 S;pombe 
chromosome I cosmid 
c4G8; Unknown; 
SPAC 


0.96 J 


135 I 


Z74825 


S.cerevisiae 
chromosome XV 
reading irame 
ORF YOL083w. 


0.77 


RNU59809__1 


Rattus norvegicus 
mannose 6- 
phosphate/insulin-like 
growth factor II 
receptor (M6P/IGF2r) 
mRNA, complete cds; 
Also termed IGF- 
Il/Man o-P receptor, 
MPR, CI-MPR 


0.01 1 


136 I 


U80445 


Caenorhabditis 
eJegans cosmid 
C50F2. 


0.76 


S28499 


probable finger protein 
- rat>GP:RNZFP_l 
R;norvegicus mRNA 
for putative zinc finger 
protein 


1.10E-31 


137 j! 


Z78545 


Caenorhabditis 
elegans cosmid 
M03B6, complete 
sequence. 


0.75 


RRU73586J 


Rattus norvegicus 
Fanconi anemia group 
C mRNA comnletp 
cds; Fanconi anemia 
group C protein; 
Similar to human FAC 
protein, GenBank 
Accession Numbers 
X66893 and X66894 


0.023 jj 



188 



BNSDOCID: <WO 9933982A2_I_> 




PCT/US98/27610 



Table 2 





Nearest 

M £>■ rr H Knr 
l^ieigll litll 

(BlastN vs. 
Genbank) 






Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 
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P 
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138 




u lim o n PiKI A 
M Uillcxll JL-'lN/A 

sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
466N1; HTGS 
phase 1. 


0 74 


HSMSHRFC 
A_l 


H*saoiens mRNA for 

1 J ^OUL/IVt|iJ 1111X1 ^/ V AWl 

MSH receptor; Author- 
given protein sequence 
is in conflict with the 
conceptual translation 


0.036 


139 


ArUU/zoy 


Araoiaopsis 
thaliana BAC 
1G002N01. 


n 71 


HSU9S090 1 


Homo ^flnierK 
chromosome 1 9 cosmid 
F 19541, complete 
sequence; F19541 1; 
Hypothetical (partial) 
nrotein similar to 
proline oxidase 


0.16 


140 


AC002393 


Mouse 
BAC284H12 
Chromosome 6, 
complete 
sequence. 


0.7 


RNLTBP2_1 


Rattus norvegicus 
mRNA for LTBP-2 like 
protein; Latent TGF- 
beta binding protein-2 
like protein 


4.40E-05 


141 


B 15232 


344G8.TV 
CIT978SKA1 

il UII1U bapiCJlD 

genomic clone A- 
344G08. 


0.67 


DMSEVL2_2 


Drosophila 

melanogaster sevenless 
mRNA; Put; sevenless 
protein (A A 1-2510) 


0.41 


142 


D 13748 


Human mRNA 
for eukaryotic 
initiation factor 
4AI. 


0.66 


MMU53563_1 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00016 


143 


S45791 


band 3-related 
protein=renal 
anion exchanger 
AE2 homolog 
[rabbits, New 
Zealand White, 
ileal epithelial 
cells, mRNA, 
3964 nt]. 


0.66 


POLS RUBV 
R 


STRUCTURAL 
POLYPROTEIN 
(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND 

E2)>PIR1:GNWVRA 
structural polyprotein - 
rubella virus f strain 
RA27/3 

vaccine)>GP:RUBCE2 
1 1 Rubella virus 
RA27/3 RNA for 
capsid, E2 and El 
proteins; Poly 


5.60E-05 


144 


M22462 


Chicken protein 
p54 (ets-1) 


0.66 


HSHP8PROT 
_1 


H;sapiens mRNA for 
HP8 protein; HP8 


2.00E-06 
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ACCESSION 


DESCRIPTION 


P 
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mRNA, complete 
cds. 






peptide 




145 


| U27999 


Human clone 
pDEL52All 
HLA-C region 
cosmid 52 
genomic survey 
sequence. 


0.65 


CA18 HUMA 

N 


COLLAGEN ALPHA 
l(VIII) CHAIN 
PRECURSOR 
(ENDOTHELIAL 
COLLAGEN)>PIR2:S 
15435 collagen alpha 
l(VIII) chain precursor 

human>GP:HSCOL8A 
1 J HumanCOL8Al 
mRNA for alpha 
1(VI1I) collagen 


5.70E-06 


1 Af% 
1 *+o 


M54787 


N.crassa mating 
type a-1 protein 
(mt a-1) gene, 
exons 1-3. 


0.64 


150717 


vacuolar H+-ATPase A 
subunit - chicken 
(fragment)>GP:GGU22 
078_1 Gallus gallus 
vacuolar H+-ATPase A 
subunit gene, partial 
cds 


0.0046 | 


147 


AC002094 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


0.63 


PVPVA1J 


P;vivax pval gene 


0.1 


148 


U32701 


Haemophilus 
influenzae from 
bases 165345 to 
176101 (section 
16 of 163) of the 
complete 
genome. 


0.63 


FABG HAEI 
N 


3-OXOACYL-[ACYL- 
CARRIER PROTEIN] 
REDUCTASE (EC 
1.1.1.100) (3- 
KETOACYL- ACYL 
CARRIER PROTEIN 
REDUCTASE)>PIR2: 
D64051 3-oxoacyl- 
[acyl-carrier-protein] 
reductase (EC 
1.1.1.100)- 
Haemophilus 
influenzae (strain Rd 
KW20)>GP:HIU32703 
7 Haemophilus 


2.00E-12 


149 


Z37159 


T.brucei serum 
resistance 
associated (SRA) 
mRNA for VSG- 
[ike protein. 


0.61 


<NONE> 


<NONE> 


<NONE> 
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P 
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150 


AF027865 


Mus musculus 
Major 

Histocompatibilit 
y Locus class 11 
region. 


0.61 


A56514 


chromokinesin - 
chicken>GP:GGU1830 
9__1 Gallus gallus 
chromokinesin mRNA, 
complete cds 


0.045 


151 


U40938 


Caenorhabditis 
elegans cosmid 
D1009. 


0.61 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03 IN 

n t\\j l vi vj o \J i vi c 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid c!3Al 1; 

SPAC13A11;03, 
unknown, len: 210 


1 .90E-24 


152 


116670 


Sequence 1 from 
patent US 
5476781. 


0.59 


CELF21F8_7 


Caenorhabditis elegans 
cosmid F21F8; Similar 
to eukaryotic aspartyl 
proteases 


0.39 


153 


Z84468 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
299D3; HTGS 
phase 1. 


0.59 


CLG1 YEAS 
T 


CYCLIN-LIKE 
PROTEIN 

CLGI>P1R2:S37607 
cyclin-like protein 
YGL215w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCYG 
L215W_1 S;cerevisiae 

r* Vi»t\tti rtcnmp \/ IT 
CHI vJiii UaLJl lie Vli 

reading frame ORF 
YGL215w>GP:YSCC 
LG1CPR 1 
Saccharomyces 
cerevisiae cyclin-like 
protein (CLG1) gene 


0.0015 


154 


U00054 


Caenorhabditis 
elegans cosmid 
K07E12. 


0.57 


<NONE> 


<NONE> 


<NONE> 


155 


M21207 


Synthetic SV40 T 
antigen mutant 
pseudogene, 3' 
end. 


0.57 


1CJL2 


cathepsin L (EC 
3.4.22.15) mutant 
(F(78P)L, C25S, 
T110A,E176G, 
D178G), fragment 2 - 
human 


0.43 
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P 
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156 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds. 


0.56 


AC002125_4 


Homo sapiens DNA 
from chromosome 1 9- 
cosmid F25965, 
genomic sequence, 
complete sequence; 
F25965_5; 

Hypothetical 35;3 kDa 
protein similar to 
GTPase-activating 
proteins and orf3 from 


0.6 


157 


M86352 


Stigmatella 
aurantiaca reverse 
transcriptase (163 
RT) gene, 
complete cds. 


0.56 


AC002398_4 


Human DNA from 
chromosome 19- 
speciflc cosmid 
F25965, genomic 
sequence, complete 
sequence; F25965_3; 
Hypothetical 96 kDa 
human protein similar 
to alpha chimaerin; 
Hypothetical 
protein>GP:AC002398 
_4 Human DNA from 
chromosome 19- 
specific cosmi 


4.50E-06 


158 


AC003101 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HRPC41C23; 
H 1 OS phase 1, 
33 unordered 
pieces. 


0.54 


<NONE> 


<NONE> 


<NONE> 


159 


B12117 


F5L15-T7 IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 


0.54 


CEF32H2_5 


Caenorhabditis elegans 
cosmid F32H2, 
complete sequence; 
F32H2;5; Similarity to 
Chicken fatty acid 
synthase (SW:P 12276); 
cDNA EST ykl6c2;5 
comes from this gene; 
cDNA EST ykl J3h6;5 
comes 1 


1 
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Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


lot) 


AnUUUoo4 


IV1US ITIU5>CU1U1> 

TCR beta locus 
from bases 
250554 to 501917 
^i>cciioii x. ui -> j ui 
the complete 
sequence. 




CFT01G9 6 


Caenorhabditis elegans 
cosmid T01G9, 
complete sequence; 
T01G9;4; CDNA EST 
yk29b7;5 comes from 
this gene 


0.84 


161 


B12117 


F5L15-T7 IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 


0.54 


A39718 


nicotinic acetylcholine 
receptor alpha chain - 
marbled electric ray 
(fragments) 


0.27 


162 


Z71261 


Caenorhabditis 
elegans cosmid 
F21C3, complete 
sequence. 


0.5 


KDGE DRO 
ME 


EYE-SPECIFIC 
DIACYLGLYCEROL 
KINASE (EC 
2.7.1.107) (RETINAL 
DEGENERATION A 
PROTFIN^ 
(DIGLYCERIDE 
KINASE) 

(DGK)>GP:DRODAG 
K J Fruit fly mRNA 
for diacylglycerol 
kinase, complete cds 


4.60E-05 


163 


M61831 


Human S- 
adenosylhomocys 
teine hydrolase 
(AHCY) mRNA, 
complete cds. 


0.49 


P2C2 A RAT 
H 


PROTEIN 

PHOSPHATASE 2C 
(EC 3.1.3.16) 
(PP2C)>PIR2:S55457 
phosphoprotein 
nhosnhatase TEC 
3.1.3.16) 2C- 
Arabidopsis 
thaliana>GP:ATHPP2 
CA_1 Arabidopsis 
thaliana mRNA for 
protein phosphatase 2C 


5.60E-08 


164 


U42608 


Glycine max 
clathrin heavy 
chain mRNA, 
complete cds. 


0.48 


<NONE> 


<NONE> 


<NONE> 
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Gen bank) 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


165 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase 1 . 


0.47 


PYRD BACS 
U 


DIHYDROOROTATE 

DEHYDROGENASE 

(EC 1.3.3.1) 

(DI H YDROOROTA TE 

OXIDASE) 

(DHODEH ASE)>PI R 1 

:H39845 

dihydroorotate oxidase 
(EC 1.3.3.1) -Bacillus 
subtilis>GPN:BSUB00 
09_25 Bacillus subtilis 
complete genome 
(section 9 of 2 1 ): from 
1598421 to 1807200; 


0.002 


166 


AC000044 


Human 
Chromosome 
22ql3 Cosmid 
Clone p76el0, 
complete 
sequence. 


0.47 


MATK MAR 
PO 


PROBABLE INTRON 
MATURASE>PIR2:A 
05034 hypothetical 
protein 370i - liverwort 
(Marchantia 
polymorpha) 
chloropiast>GP:CHMP 
XX_21 Liverwort 
Marchantia 

polymorpha chloroplast 
genome DNA; 
ORF370i 


0.0011 


167 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial). 


0.47 


S45361 


LRJ147 protein - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
LRR47J 

D;melanogaster mRNA 
for LRR47 


5.30E-07 


168 


Z67035 


H.sapiens DNA 
segment 
containing (CA) 
repeat; clone 
AFM323yf 1 ; 
single read. 


0.45 


JQ2246 


22. 5K cathepsin D 
inhibitor protein 
precursor - 

potato>GP:POTCATH 
D_l Potato cathepsin D 
inhibitor protein 
mRNA, complete cds 


0.79 


1 £Q 

1 D7 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone ! 
6B17; HTGS 
phase 1. 


0.44 


SMU31768_1 


Schistosoma mansoni 
elastase gene, 3045 bp 
clone, complete cds 


0.0022 
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Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCR1F1ION 


P 

VALUE 


170 


LI 1 172 


Plasmodium 
falciparum RNA 
polymerase I 
gene, complete 
cds. 


0.43 


HUMPKD1G0 
8 J 


Homo sapiens 
polycystic kidney 
disease (PKD1) gene, 
exons 43-46; Polycystic 
kidney disease 1 
protein 


1 


171 


Z95889 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
211A9; HTGS 
phase 1 . 


0.43 


A09811J 


R;norvegicus mRNA 
for BRL-3A binding 
protein; Author-given 
protein sequence is in 
conflict with the 
conceptual translation 


U.UUUoJ 


172 


U32772 


Haemophilus 
influenzae from 
bases 9548 19 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.43 


YPT2 CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
II1>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4 8 Caenorhabditis 
elegans cosmid F37A4 


2.50E-28 


173 


Z99281 


Caenorhabditis 
elegans cosmid 
Y57G11C, 
complete 
sequence. 


0.42 


PTU19464_1 


Paramecium tetraurelia 
outer arm dynein beta 
heavy chain gene, 
complete cds 


1 


174 


X04571 


Human mRNA 
for kidney 
epidermal growth 
factor (EGF) 
precursor. 


0.42 


YEK9YEAS 
T 


HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
1NTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w - yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37__20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 


0.99 
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Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


175 


U32772 


Haemophilus 
influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.41 


YPT2 CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A4 


7.80E-21 


176 


AC002053 


Human 
Chromosome 
9p22 Cosmid 
Clone 92f5, 
complete 
sequence. 


0.4 


HSU33837_1 


Human glycoprotein 
receptor gp330 
precursor, mRNA, 
complete cds 


I 


177 


U88309 


Caenorhabditis 
elegans cosmid 
T23B3. 


0.4 


DROMTTGN 
C_l 


Drosophila 
meianogaster 
mitochondrial 
cytochrome c oxidase 
subunit I (COI) gene, 5' 
end, Trp-, Cys-, and 
Tyr-tRNA genes, 
NADH dehydrogenase 
subunit 2 (ND2) gene, 
3' end 


0.99 


178 


M34025 


Human fetal Ig 
heavy chain 
variable region 
(clone M44) 
mRNA, partial 
cds. 


0.39 


DNA2 YEAS 
T 


DNA REPLICATION 
HELICASE 
DNA2>PIR2:S48904 
probable purine 
nucleotide-binding 
protein YHR164c- 
yeasi ^oaccnaromyces 
cerevisiae)>GPN:YSC 
H9986_3 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9986; 
Dna2p: DNA 
replication helicase; 
YHR1640GP: 


1 


| 179 


AC002395 


Homo sapiens; 
HTGS phase 1, 
127 unordered 
pieces. 


0.39 


VVJvlUMPE 


NONSTRUCTURAL 
PROTEIN V 
(NONSTRUCTURAL 
PROTEIN NS1) 


0.11 I 
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Neighbor 
(BlastX vs. 
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Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


180 


AC003101 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 

chromosome 17, 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 
pieces. 


0.39 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D 1044.2 IN 
CHROMOSOME 
II1>GP:CELD1044_4 
Caenorhabditis elegans 
cosmid D1044 


0.0001 


181 


Z54335 


Human DNA 
sequence from 

rncm iH i 1 7 AO 

Huntington's 
Disease Region, 
chromosome 
4pl6.3. Contains 
VNTR and a CpG 
island. 


0.39 ; 


HUMNFAT3 
A J 


Homo sapiens NF-AT3 
mRNA, complete cds 


1.60E-06 


182 


U95743 


Homo sapiens 
cnrornosuiiic 10 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


0.38 


CEZC434J5 


Caenorhabditis elegans 
cosmid ZC434, 
complete sequence; 
ZC434;6; CDNA EST 
CEESO02F comes 
from this gene; cDNA 
EST CEESS60F comes 
from this gene 


0.18 


1 CI 




F5I14from 

Arabidopsis 

thaliana 

chromosome 1, 

complete 

sequence. 


0.34 


HSOCAM 1 


H;sapiens mRNA for 
immunoglobulin-like 
domain-containing 1 
protein 


0.051 


184 


X01703 


Human gene for 
alpha-tubulin (b 
alpha 1 ). 


0.33 


NTC3_MOUS 
E 


NEUROGENIC 
LOCUS NO I LH J 
PROTEIN>PIR2:S453 
06 notch 3 protein - 
mouse>GP:MMNOTC 
1 M;musculus mRNA 
for Notch 3 


0.012 
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Nearest 
i>cignDor 
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Non- 
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Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


IBS 


7821 8Q 


Unman rVM A 
n UIIld.Il L/1N/A 

sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
170A21; HTGS 
phase 1. 


U.J 1 


LG 1 06 3 


Lemna gibba 
negatively light- 
regulated mRNA 
(Lgl06); Second 
longest ORF (2) 


0.27 


186 


Z9805I 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
501A4;HTGS 
phase 1 . 


0.3 


S34960 


NADH dehydrogenase 
(ubiquinone) (EC 
1.6,5.3) chain 5- 
Crithidia oncopelti 
mitochondrion 
(SGC6)>GP:MICOCN 
NR 3 Crithidia 
oncopelti mitochondrial 
ND4, ND5, COI, 12S 
ribosomal RNA genes 
for NADH 

dehydrogenase subunit 
4/5, cytochrome 
oxidase subun 


0.25 


187 


Z98749 


Human DNA 
sequence *** 
SEQUENCING 

TXT DDAnDTTCC 

*** from clone 
449017;HTGS 
phase 1 . 


0.3 


SCKC LEIQ 
H 


CHARYBDOTOXIN 
(CHTX) (CHTX- 
LQ1)>PIR2:A60963 
charybdotoxin 1 - 
scorpion (Leiurus 
quinquestriatus)>3D:2 
CRD Charybdotoxin 
(nmr, 12 structures) - 
scorpion (Leiurus 
auinauestriatu^ 


0.12 


188 


X96763 


C.albicans CDC4 
gene. 


0.29 


CECC4_1 


Caenorhabditis elegans 
cosmid CC4, complete 
sequence; CC4;a; 
Protein predicted using 
Genefinder; 
preliminary prediction 


1.30E-17 
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Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P ! 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


189 


U38804 


Porphyra 

purpurea 

chloroplast 

genome, 

complete 

sequence. 


0.28 


HIVHCDR3C 
J 


Human 

immunodeficiency 
virus type 1 heavy- 
chain complemetarity- 
determining region 3 
mRNA (clone 11), 
partial cds; Heavy- 
chain complementarity- 

Hetprm ininf* region 3 

(CDR3) from HIV 
gpl20- 

>GP:HIVHCDR3I_1 
Human 

immunodeflciencv 
virus type 1 he 


1 


190 


U20657 


Human ubiquitin 
protease (Unph) 
proto-oncogene 
mRNA, complete 
cds. 


0.28 


HSU20657J 


Human ubiquitin 
protease (Unph) proto- 
oncogene mRNA, 
complete cds 


5.60E-12 


191 


AC002037 


Human 

cnromosome 1 1 

Overlapping 

Cosmids 

cSRL72g7 and 

cSRL140b8, 

complete 

sequence. 


0.27 


VRP1YEAS 

T 
i 


VERPROLIN>GP:SC 
VFRPRL 1 
S;cerevisiae (A364) 
gene for verprolin 


2.00E-11 


192 


U58748 


Caenorhabditis 
elegans cosmid 
ZK180. 


0.27 


EXLP TOBA 
C 


P1ST1L-SECIFIC 

EXTENSIN-LIKE 

PROTEIN 

PRECURSOR 

(PELP)>PIR2:JQ1696 

ni^til extensin-like 

piolll wAlv I 1 Jill lllvv 

protein precursor (clone 
pMG15) - common 
tobacco>GP:NTPMG 1 
5_1 N;tabacum mRNA 
for pistil extensin like 
protein 


4.10E-12 


193 


Z68013 


Caenorhabditis 
elegans cosmid 
W02H3, 
complete 
sequence. 


0.26 


<NONE> 


<NONE> 


<NONE> 
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(BJastX vs. 
Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


194 


AFO 17042 


Dictyostelium 
discoideum LTR- 
retrotransposon 
Skipper, partial 
genomic 
sequence, 5' end. 


0.26 


SPBC31F10 1 
4 


S;pombe chromosome 
II cosmid c31F10; 
Hypothetical protein; 
SPBC31F10;14c, 
unknown, len:1586aa, 
some similarity eg; to 
YJR140C, 
YJ9H_YEAST, 
P47171, involved in 
cell cycle regulation 


1 


1 


B03174 


cSRL-16e2-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
cSRL-16e2. 


0.26 


CELC30E1J7 


Caenorhabditis elegans 
cosmid C30E1 


0.38 




X70810 


E.gracilis 
chloroplast 
complete 
genome. 


0.25 


CEK10H10_8 


Caenorhabditis elegans 
cosmid Kl OH 10, 
complete sequence; 
K10H10;k; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.98 


197 


U80024 


Caenorhabditis 
elegans cosmid 
C18B10. 


0.25 


MMAF001794 
_1 


Mus musculus Treacher 
Collins Syndrome 
protein (Tcofl) mRNA, 
complete cds; Putative 
nucleolar 

phosphoprotein; similar 
to Homo sapiens 
Treacher Collins 
syndrome TCOF1 
protein 

encoded>GP:MMAF00 
I794_J Mus musculus 
Treacher Collins 
Syndrome p 


0.017 
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Genbank) 
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Neighbor 
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Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


198 : 


AC000591 


Drosophila 
melanogaster 
(subclone 9_g3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 
1 


HYPOTHETICAL 64.6 
KD PROTEIN IN 

INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 
7 Escherichia coli , 
mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0.00068 














199 


AC000591 


Drosophila 
melanogaster 
(subclone 9_g3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 
I 


HYPOTHETICAL 64.6 
KD PROTEIN IN 

INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 

7 T- ^rhprirhia coli 

/ L_^0^1 lt>l 1^1 MCI VsVJ J 1 , 

mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0.00068 














200 


Z99571 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
388N15;HTGS 
phase 1 . 


0.24 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A1I.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid cl3Al 1; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


0.017 


201 


j U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.24 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 
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Gen bank) 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


202 


AC003061 


*** 

SEQUENCING 
IN PROGRESS 
*** Mouse 
Chromosome 6 
BAC clone 
b245cl2; HTGS 
phase 2, 8 
ordered pieces. 


0.23 


CGI HUMA 
N 


CGI 

PROTEIN>GP:HSU46 
023_1 Human Xq28 
mRNA, complete cds; 
Orf 


0.00078 


203 


AF009420 


Homo sapiens 
microsatellite 
sequence in the 
HNF3agene. 


0.22 


PN0675 


collagen alpha 
1(XVI1I) chain - mouse 
(fragment)>GP:MUSC 
OLLAG_l Mouse 
mRNA for collagen, 
partial cds 


0.00072 


204 


B18861 


F20C18-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F20C18. 


0.22 


TFDP00659 


- Polypeptides entry 
for factor PR 


0.0003 


205 


U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.22 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 


206 


X52105 


Dictyostelium 
discoideum SP60 
gene for spore 
coat protein. 


0.18 


<NONE> 


<NONE> 


<NONE> 


207 


L07628 


Saccharopolyspor 
a erythraea 
insertion 

sequence IS 1 136, 
copy B, 3' end. 


0.17 


D88764J 


Rana catesbeiana 
mRNA for alpha 2 type 
I collagen, complete 
cds 


0.00021 


208 


74961 1 


O.C-CI C VIbidC 

chromosome X 
reading frame 
ORF YJR13 Iw. 


U. 1 0 


1 


Saccharomyces 
cerevisiae alantoinase 
(DAL 1) gene, complete 
cds 


1 


209 


Z87893 


F.rubripes GSS 
sequence, clone 
043C17aB8. 


0.16 


CELC27A12 
8 


Caenorhabditis elegans 
cosmid C27A12; 
Partial CDS; this gene 
begins in the 
neighboring clone; 
coded for by C; elegans 
cDNA ykl27fl;3; 
coded for by C; elegans 
cDNAyk!27fl;5 


1.30E-07 



202 



BNSDOCID: <WO 9933982A2_I_> 




PCT/US98/27610 



Table 2 





Nearest 

JVT ■ rrh li f\ r 

pceignuur 
(BlastN vs. 
Genbank) 






Nearest 
Neigh bor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


p i 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


210 


U92852 


Rhoiptelea 

Cni Manilla 

maturase (matK) 
gene, chloroplast 
gene encoding 
chloroplast 
protein, complete 
cds. 


0.15 


SEU40259_5 


Staphyloccous 
epidermidis 
trimethoprim resistance 
plasmid pSK639; Orf53 


0.95 


211 


X62620 


B.mori Abd-A 
gene homeobox. 


0.15 


ATAP22_36 


Arabidopsis thaliana 
DNA chromosome 4, 
FSSA I AP2 contie 
fragment No; 2; 
Hypothetical protein; 
Similarity to NADH 
dehydrogenase, 
Chondrus crispus; 
MNOS:S59107 


0.75 


212 


J02079 


ep stein- ban* virus 
simple repeat 
array (ir3). 


U. 1 D 


r\J OJHO 


iiltrA-hioh-<;iilfiir 

HI LI a 111 till jUIIUJ 

keratin 1 - 

mouse>GP:MUSSERl 
1 Mouse serine 1 ultra 
high sulfur protein 
gene, complete cds; 
Putative 


7.50E-05 


213 


M35027 


Vaccinia virus, 

complete 

genome. 


0.14 


MTF1FUSN 
I j 


MODIFICATION 
METHYLASE FNUDI 
(EC 2.1.1.73) 
(CYTOSINE- 
SPECIFIC 

METHYLTRANSFER 
ASE FNUDI) 
(M. FNUDI) 


0.87 


214 


AC003058 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'IGF 
BAC 'F27F23' 
genomic 
sequence near 
marker 
'CIC06E08'; 
HTGS phase 1, 8 
unordered pieces. 


0.14 


HEXAJ3ICDI 


BETA- 

HEXOSAMINIDASE 
ALPHA CHAIN 
PRECURSOR (EC 
3.2.1.52) (N-ACETYL- 
BETA- 

GLUCOSAMINIDASE 
) (BETA-N- 
ACETYLHEXOSAMI 
NIDASE)>PIR2:A307 
66 beta-N- 

acety 1 hexosam i n i dase 
(EC 3.2. 1.52) A 
precursor - slime mold 
(Dictyostelium 


0.006 
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ACCESSION 
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ACCESSION 


DESCRIPTION 


P 

VALUE 












discoideum)>GP:DDIN 
AGA l D;d 




215 


AC001229 


Sequence of BAC 

F5I14from 

Arabidopsis 

thaliana 

chromosome 1, 

complete 

sequence. 


0.13 


A49281 


pol protein - simian T- 
cell lymphotropic virus 
type 1, STLV-J (isolate 
Bab34) 

(fragment)>GP:STVB 
ABPOLA_l Simian T- 
cell leukemia virus 
PCR derived (pol) 
gene, partial sequence 
BAB34POL; Bases 
4779-4918 EMBL 
ATK numbering 
system; BAB34POL 


0.77 


216 


U46067 


Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 


0.12 


S70663 


lectin heavy chain, N- 
acetylgalactosamine- 
specific - Entamoeba 
histolytica 

(fragment)>GP:EHU33 
443_1 Entamoeba 
histolytica GalNAc 
lectin heavy subunit 
(hgl4) gene, partial cds; 
N-acetylgalactosamine 
adherence lectin heavy 
subunit 


0.8 


217 


AC000380 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 3 
pac pDJ70ill; 
HTGS phase 1,2 
unordered pieces. 


0.12 


ATFCA8J9 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
Unnamed protein 
product 


0.64 
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Non- 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


218 


X61207 


A.brasilense hisB, 

H, A, F and E 

genes for 

imidazole 

glycerolphosphat 

e dehydratase, 

glutamine 

am idotransferase, 

phosphory bosi lfo 

rmimino-5- 

amino- 

phosphorybosil- 
4- 

imidazolecarboxa 
mide isomerase, 
cyclase and 
phosphory bosi 1- 
AMP- 

cyclohydrolase. 


0.12 


OCCL02J 


0;circumcincta colost- 
2 gene; Cuticular 
collagen 


0.0074 


219 


AF014259 


HIV-1 Patient 
1088 from 
Edinburgh, MA- 
pl7(gag) gene, 
partial cds. 


0.11 


DMU88570_1 


Drosophila 
melanogaster CREB- 
binding protein 
homolog mRNA, 
complete cds; CBP 


1 


220 


AC000636 


Drosophila 
melanogaster 
(subclone 2 cl 1 
from PI DS07660 
(D44)) DNA 
sequence, 
complete 
sequence. 


0.11 


A64829 


hypothetical protein in 
dmsC 3* region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000192_ 
1 Escherichia coli , 
ycaD, ycaK, pflA, pflB, 
focA genes from bases 
944908 to 955952 
(section 82 of 400) of 
the complete genome; 
Hypothetical protein in 
dmsC 


0.051 


221 


AC002428 


Human BAC 
clone GS039E22 
from 5q3 1 , 
complete 
sequence. 


0.11 


HSNMYC2J 


Human N-myc gene 
exon 2; Put; N-myc 
protein (aa 1-263) (953 
is 1 st base in codon) 


0.00014 


222 


L40949 


Homo sapiens 
(clone AT7-5eu) 
opioid-receptor- 
like protein 
mRNA, 5' end. 


0.11 


CEUNC93_2 


C;elegans unc-93 gene; 
Protein 2 


1.20E-13 
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ACClLiSMOiN 
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P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


223 


AL008636 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
722E9; HTGS 
phase 1. 


0.1 


XELCOL2A 1 

A__l 


Xenopus laevis alpha- 1 
collagen type II' 
mRNA, complete cds; 
Alpha-1 type IF 
collagen 


2.60E-06 


224 


D86993 


Human (lambda) 
DNA for 
immunoglobulin 
light chain. 


0.1 


CELM02B7_2 


Caenorhabditis elegans 
cosmid M02B7 


1.80E-09 


225 


AC002539 


Homo sapiens 
chromosome 17, 
clone 195o20, 
complete 
sequence. 


0.098 


MTCY7D1 1 
17 


Mycobacterium 
tuberculosis cosmid 
Y7D11; Unknown; 
MTCY07Dll;17c; 
unknown, len: 1 86 aa, 
FASTA best: Q10390 
Y009_MYCTU 
hypothetical 3 1 ;0 KD 
protein MTCY190;09C 
(299 aa) opt: 355 z- 
score: 316;8 


0.026 


226 


M88165 


Human inter- 
alpha-trypsin 
inhibitor light 
chain (ITI) gene, 
exon 1. 


0.096 


A54161 


ryanodine-binding 
protein alpha form - 
bul!frog>GP:D21070_l 
Rana catesbeiana 
mRNA for bullfrog 
skeletal muscle calcium 
release channel 
(ryanodine receptor) 
alpha isoform(RyRl), 
complete cds; 
Ryanodine receptor 
alpha isoform 


1 


227 


Z92851 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39G8; HTGS 
phase 1. 


0.082 


CYA7 BOVI 
N 


ADENYLATE 
CYCLASE, TYPE VII 
(EC 4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE) 


0.3 
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SEQ 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


228 


L00638 


Arabidopsis 
thaiiana ubiquitin 
conjugating 
enzyme exons 2- 
4. 


0.072 


NUCM TRY 
BB 


xt a fat t i rr"> I /~\ I TIXIAXTr 1 

NADH-UBIQUINONE 
OXIDOREDUCTASE 
49 KD SUBUNIT 
HOMOLOG (EC 
1.6.5.3) (NADH 
DEHYDROGENASE 
SUBUNIT 7 
HOMOLOG)>PIR2:A3 
5693 NADH 
dehydrogenase (EC 
1.6.99.3) chain 7- 
Trypanosoma brucei 
mitochondrion (SGC6) 




Tin 


U49169 


Dictyostelium 
discoideum V- 
ATPase A 
subunit (vatA) 
mRNA, complete 
cds. 


0.071 


MMU65594J 


Mus musculus Brca2 
mRNA, complete cds; 
Similar to human breast 
cancer susceptibility 
gene BRCA2; Allele: 
wild type; putative 
tumor suppressor 


1 


230 


AF001549 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987SK- 
270G1 complete 
sequence. 


0.07 


PM22 HUMA 
N 


PERIPHERAL 
MYELIN PROTEIN 22 
(PMP- 

22)>PIR2:JN0503 
peripheral myelin 
protein 22 - 
human>GP:HUMGAS 
3X_1 Human 
peripheral myelin 
protein jll ^vj/\z>j ) 
mRNA, complete 
cds>GP:HUMPMP22_ 
1 Human peripheral 
myelin protein 22 
mRNA, complete 
cds>GP:HUMPMP22 


0.0078 


231 


L36829 


Mus musculus 
alphaA-crystallin- 
binding protein I 
(AlphaA- 
CRYBPl)gene, 
complete cds. 


0.066 


<NONE> 


<NONE> 


<NONE> 


232 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone 1 Iql 3; 


0.058 


CEZK863_1 


Caenorhabditis elegans 
cosmid ZK863, 
complete sequence; 
ZK863;2; Similar to 
collagen 


1 
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: Nearest 
1 Neighbor 

(BlastN vs. 

Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 






h J Ob phase 1, 
10 unordered 
pieces. 










233 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone llql3; 
HTGS phase 1, 
1 0 unordered 
pieces. 


0.058 


CAC2 HAEC 
O 


CUTICLE 
COLLAGEN 2C 
(FRAGMENT)>GP:H 
AECOL2CJ 
H;contortus collagen 
2C mRNA, 3'end 


1.20E-08 


234 


Z23908 


H. sapiens 
(D5S630) DNA 
segment 
containing (CA) 
repeat; clone 
AFM268zd9; 
single read. 


0.057 


VEU34999J 


Venezuelan equine 
encephalitis virus 
nonstructural and 
structural polyprotein 
genes, complete cds; 
Nonstructural 
polyprotein; Internal 
stop codon, 

readthrough occurs 5% 
of the time 


0.0002 


235 


B21875 


T3E8-Sp6 TAMU 
Arabidopsis 
thaliana genomic 
clone T3E8. 


0.055 


YRR2 CAEE 
L 


HYPOTHETICAL 91.1 
KD PROTEIN R 144.2 
IN CHROMOSOME 
III>GP:CELR144_7 
Caenorhabditis elegans 
cosmid R144; Coded 
for by C; elegans 
cDNA CEESP84R; 
coded for by C; elegans 
cDNA yk23c4;5; coded 
for by C; elegans 
cDNA yk44f9;5; coded 
for by C; eleg 


0.68 


236 | 


Z98303 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
140H19; HTGS 
phase 1. 


0.048 


AC002330__3 


Arabidopsis thaliana 
BACT10P11, 
complete sequence; 
Putative zinc-finger 
protein; C2H2 Zn- 
finger signature from 
position 80 to 100 
[CEICNKGFQRDQNL 
QLHRRGH] 


0.99 
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Genbank) 
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SEQ 
ID 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


237 


D49911 


Therm us 
thermophilus 
UvrA gene, 
complete cds. 


0.044 


APP1 MOUS 
E 


AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(AP\ PV>PIR2A46362 
amyloid precursor-like 
protein - 

mouse>GP:MUSAPLP 

1 Mouse amyloid 
nrpri ir<;or-lilcp nrotein 
mRNA, complete cds 


8.90E-06 


238 


D49911 


Therm us 
thermophilus 
UvrA gene, 
complete cds. 


0.044 


MMCOL18A1 
1_2 


Mus musculus alpha- 
1(X VIII) collagen 
(COL18Al)gene, 
exons 40-43, complete 
cds 


1 .60E-06 


239 


X78119 


P.amygdalus, 
Batsch (Texas) 
prul mRNA. 


0.042 


CA44 HUMA 
N 


COLLAGEN ALPHA 
4(1 V) CHAIN 
PRECURSOR>PIRl :C 

UrlLJ 1 O CUlldgCIl aipila 

4(1 V) chain precursor - 
human>GP:HSCOL4A 
4_1 H;sapiens mRNA 
for collagen type IV 
alpha 4 chain; Type IV 
collagen alpha 4 chain 


2.00E-06 


240 


U72877 


Rana catesbeiana 
L-epinephrine 
transporter 
mRNA, complete 
cas. 


0.041 


YRR6 MYCC 
A 


HYPOTHETICAL 33.0 
KD PROTEIN IN 
LICA 3'REGION (ORF 
R6)>PIR2:S42125 

Mycoplasma 
capricolum 
(SGC3)>GP:MYCRP 
MH 6 M; capricolum 
rpmH, rnpA and lie A 
t?ene* Orf* R6 


0.0008 


241 


L39891 


Homo sapiens 
polycystic kidney 
disease- 
associated protein 
(PKDl)gene, 
complete cds. 


0.04 


MUC2 HUM 
AN 


MUCIN 2 
(INTESTINAL 
MUCIN 2) 
(FRAGMENTS) 


5.90E-05 


242 


L40390 


Candida glabrata 
ERG3 gene, 
complete cds. 


1 0.039 


GO 1763 


atrophin-1 - 
human>GP:HSU23851 
_1 Human atrophin-1 
mRNA, complete cds 


9.00E-07 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


243 


B28113 


T2L16TRB 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2L 16. 


0.038 


CELZK1248 
14 


Caenorhabditis elegans 
cosmid ZK1248 


1.60E-18 


244 


AC000030 


00175, complete 
sequence. 


0.033 


ATFCA8_40 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
GlyceroI-3-phosphate 
permease homolog; 
Similarity to glycerol- 
3-phosphate permease - 
Haemophilus 
influenzae 


0.63 


245 


B 10738 


F13G15-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F13G15. 


0.032 


D87521J 


Mus musculus DNA- 
PKcs mRNA, complete 
cds 


0.21 


246 


AF024503 


Caenorhabditis 
elegans cosmid 
F31F4. 


0.03 


138344 


titin - human 


1 


247 


Z49888 


Caenorhabditis 
elegans cosmid 
F47A4, complete 
sequence. 


0.027 


KSU520641 


Kaposi's sarcoma- 
associated herpes-like 
virus ORF73 homolog 
gene, complete cds; 
Herpesvirus saimiri 
ORF73 

homolog>GP:KSU756 
98 78 Kaposi's < 
sarcoma-associated 
herpesvirus long unique 
region, 80 putative 
ORFs and kaposin 
gene, complete cds; OR 


3.40E-10 


248 


Z83822 


Human DNA 
sequence from 
PAC 306D1 on 
chromosome X 
contains ESTs. 


0.025 


GRSB BACB 
R 


GRAMICIDIN S 
SYNTHETASE II 
(GRAMICIDIN S 
BIOSYNTHESIS 
GRSB PROTEIN) (EC 
6.-.-.-) 


1 


249 


Z94161 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
N102C10; HTGS 


0.025 


SI 6323 


hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHBl_l 
A;thaliana homeobox 
gene Athb-1 mRNA; 
Open reading frame 


0.0079 ! 
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ACCESSION 


DESCRIPTION 


P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 






phase 1. 










250 


AC002094 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


0.021 


S57447 


HPBRII-7 protein - 
human>GP:HSHPBRII 
4 1 H;sapiens HPBRII- 
4 

mRNA>GP:HSHPBRII 
7 J H;sapiens HPBRII- 
7 gene 




251 


D79994 


Human mRNA 
for KIAA0172 
gene, partial cds. 


0.021 


CER10H10_1 


Caenorhabditis elegans 
cosmid R10H10, 
complete sequence; 
R11A8;7; Protein 
predicted using 
Genefinder; Similarity 
to Mouse ankyrin (P1R 
Acc; No; S37771); 
cDNA EST CEESX25F 
comes from this gene; 


7.00E-16 


252 


Z97635 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
438L4; HTGS 
phase 1. 


0.017 


CELW05H7_4 


Caenorhabditis elegans 
cosmid W05H7 




253 


X84996 


X.laevis mRNA 
for selenocysteme 
tRNA acting 
factor (Staf). 


0.017 


JN0786 


integrin beta-4 chain 
precursor - mouse 


0.088 


254 


AC002543 


Human BAC 
clone RG300C03 
from 7q31.2, 
complete 
sequence. 


0.013 


MZLMTCYT 
BT_1 


Mendozellus isis 
mitochondrial NADH 
dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RNA- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.044 
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P 
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255 


U10401 


Caenorhabditis 
elegans cosmid 
T20B12. 


0.012 


MMMHC29N 
7_2 


Mus musculus major 
histocompatibility 
locus class III 
region:butyrophilin-like 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 
alpha, palmitoyl- 


0.069 


256 


LI 4593 


Saccharomyces 
cerevisiae protein 
phosphatase 
(PTC 1) gene, 
complete cds. 


0.011 


D86995J 


Human (gene 1) DNA 
for phosphatase 2C 
motif, partial cds 


2.20E-14 


257 


U62317 


Chromosome 
22ql3 BAC 
Clone 

CIT987SK- 
3o4JJo complete 
sequence. 


0.0093 


P2Y8 XENL 
A 


P2Y PURINOCEPTOR 
8 

(P2Y8)>GP:XLP2Y8_ 
1 X;laevis mRNA for 
P2Y8 nucleotide 
receptor 


0.89 


258 


D29655 


Pig mRNA for 
UMP-CMP 
kinase, complete 
cds. 


0.0075 


AF004858J 


Mus musculus platelet 
activating factor 
receptor mRNA, partial 
cds; PAF-receptor 


1 


259 


Ah 002992 


Homo sapiens 
cosmid from 
Xq28, complete 
sequence. 


0.0054 


FBN1 BO VI 
N 


FIBRILLIN 1 
PRJECURSOR>PIR2:A 
55567 fibrillin I - 
bovine>GP:BOVXAA 
AA I Bos taurus 
mRNA, complete cds; 
Putative 


0.0004 


260 


B20752 


T19M2-T7 
TAMU 
Arabidopsis 
thaliana genomic 
clone T19M2. 


0.0043 


HSVT1IEPJ 


Feline herpesvirus type 
1 gene for immediate 
early protein, complete 
cds; Feline herpesvirus 
type 1 immediate early 
protein 


3.90E-05 
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ID 
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DESCRIPTION 


P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 


261 


AB006699 


Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: MDJ22. 


0.0037 


YHV5 YEAS 
T 


HYPOTHETICAL 
143.6 KD PROTEIN 
IN SPO16-REC104 
INTERGENIC 
REGION>PIR2:S4675 
4 hypothetical protein 
YHR155W- yeast 
(Saccharomyces 
cerevisiaej-^Oi in . y 
H9666_15 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9666; 
Yhrl 55wp; Similar to 
Sip3p (Snf 


0.077 


262 


Z99128 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
422H 1 1 ; H 1 Lib 
phase 1. 


U.UU32 


at 1 11 UI TN/f 

ALU I MUlvi 

AN 


f f 1 1 AT IT 

SUBFAMILY J 
WARNING ENTRY 
!!!! 


0 0087 

\J .\J\J o / 


263 


B21848 


T2D2-Sp6 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2D2. 


0.0031 


B31794 


mdm- 1 protein (clone 
cl03) - mouse 


l.OOE-05 


264 


L33853 


Human germline 
immunoglobulin 
kappa chain 
variable region 
(Vk-IV subgroup) 
for anti-B- 
amyloid 

autoantibodies in 

Alzheimer's 

disease. 


0.0027 


B45550 


cytochrome b homolog 
- Plasmodium yoelii 


0.99 


265 


B36863 


HS-1042-A1- 
F01-MR abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 824 
Col=l Row=K. 


0.0027 


YQK4_CAEE 
L 


HYPOTHETICAL 64.3 
KD PROTEIN 
C56G2.4 IN 
CHROMOSOME 
1II>GP:CELC56G2_2 
Caenorhabditis elegans 
cosmid C56G2 


0.81 
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DESCRIPTION 


P 
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P 

VALUE 


266 


AC003041 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HCIT307A16; 
HTGS phase 1, 
10 unordered 
pieces. 


0.0024 


GLB4 LAMS 
P 


GIANT 

HEMOGLOBIN AIV 
CHAIN 

(FRAGMENT)>PIR2: 
S01 810 hemoglobin 
AIV - tube worm 
(Lamellibrachia sp.) 
(fragment) 


0.94 


j 267 


AC002315 


Mouse BAC- 
146N21 

Chromosome X 

contains 

iduronate-2- 

sulfatase gene; 

complete 

sequence. 


0.0022 


MG42 TARM 
A 


SRY-RELATED 
PROTEIN MG42 
(FRAGMENT)>PIR3:I 
51369 Sry-related 
sequence - Tarentola 
mauritanica 
(fragment)>GP:TELM 
G42DNA1 Gecko 
MG42 gene, partial 
cds; Sry-related 
sequence 


0.99 


268 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


SCYJL204C 
1 


S;cerevisiae 
chromosome X reading 
frame ORF YJL204c 


1 


269 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


CEM199_3 


Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.97 


270 


AFO 16674 j 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


CEM199_3 


Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.97 


271 


Z54199 


L.esculentum 
DNA Ailsa craig 
encoding 1- 
aminocyclopropa 
ne- 1 -carboxylic 
acid oxidase. 


0.0015 I 


CELF20A1__5 


Caenorhabditis elegans 
cosmid F20A1; Coded 
for by C; elegans 
cDNA yk9gl;3; coded 
for by C; elegans 
cDNA yk9gl;5; coded 
for by C; elegans 
cDNA CEESU55F; 


0.11 
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weak similarity to 
putative 




272 


Z99943 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
313L4; HTGS 
phase 1. 


0.0014 


CEK08F8_5 


Caenorhabditis elegans 
cosmid K08F8, 
complete sequence; 
K08F8;5b 


0.93 


273 


S81083 


beta- 

ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton 
protein, beta - 
ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton protein 
{alternatively 
spliced, exon 10 
to 13 region} 
[human, 
Genomic, 1851 
nt, segment 3 of 
3]. 


0.0013 


MTCY277_7 


Mycobacterium 
tuberculosis cosmid 
Y277; Unknown; 
MTCY277;07c, 
unknown, len: 302 


0.0001 


i 274 


Z82174 


Human DNA 
sequence irorn 
cosmid B20F6 on 
chromosome 
22ql 1.2-qter. 


0.001 


FBLA_HUM 
AN 


FIBULIN-1, 
ISOFORM A 
PRECURSOR>GP:HS 
FIBUA 1 H;sapiens 
mRNA for fibulin-1 A 


0.00063 


275 


Z82215 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6802; HTGS 
phase 1 . 


0.00079 


BFR1 SCHP 
O 


BREFELDIN A 
RESISTANCE 
PROTEIN>PIR2:S522 
39 hba2 protein - 
fission yeast 
(Schizosaccharomyces 
pombe)>GP:SPHBA2 
GEN_1 S;pombe hba2 
gene 


0.15 
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276 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.00071 


CX2HEMHA 


CYTOTOXIN 2 
(TOXIN 12A) 


0.32 


277 


Z82204 


Human DNA 
sequence from 
clone J362G171. 


0.00054 


DMU34925_2 


Drosophila 
melanogaster DNA 
repair protein (mei-41 ) 
gene, complete cds, and 
TH 1 gene, partial cds 


0.045 


278 


AC002530 


Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


0.00053 


CELT28F2_2 


Caenorhabditis elegans 
cosmid T28F2; Weak 
similarity to HSP90 


0.037 


279 


U91322 


Human 

chromosome 

16pl3 BAC clone 

CIT987SK-276F8 

complete 

sequence. 


0.00051 


CEW08D2_2 


Caenorhabditis elegans 
cosmid W08D2, 
complete sequence; 
W08D2;3; Protein 
predicted using 
Genefinder>GP:CEW0 
8D2_2 Caenorhabditis 
elegans cosmid 
W08D2; W08D2;3; 
Protein predicted using 
Genefinder 


0.26 


280 


D 16986 


Human HepG2 
partial cDNA, 
clone 

hmd2b09m5. 


0.00037 


POLG PPVN 
A 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 KD 
PROTEIN; NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


0.48 


281 


U91318 


Human 
chromosome 
16pl3 BAC clone 
CIT987SK- 
962B4 complete 


0.00031 


<NONE> 


<NONE> 


<NONE> 
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ACCESSION 
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P 
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sequence. 










282 


M93406 


Human dispersed 
Alu repeats and 
dispersed LI 
repeat. 


0.0003 


VG8_SPV4 


GENE 8 

PROTEIN>PIRl:G8BP 
SV gene 8 protein - 
spiroplasma virus 4 
(SGC3) 


0.23 


283 


AC002398 


Human DNA 
from | 
chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, 
complete 
sequence. 


0.00021 


HMCA DRO 
ME 


HOMEOTIC 
CAUDAL 

PROTEIN>PIR2:A263 
57 homeotic protein 
Cad - fruit fly 
(Drosophila 
melanogaster)>GP:DR 
OCADA2J 
D;melanogaster caudal 
gene (cad) encoding a 
maternal and zygotic 
transcript, exon 2; 
Caudal 

protein>TFD:TFDP001 
59 - Polypeptides en 


0.021 


284 


AC002530 


Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


0.0002 ; 


PL0009 


complement 
C3d/Epstein-Barr virus 
receptor precursor - 
human 


0.7 


285 


X01871 


Yeast 

mitochondrial 
ori(o) repeat unit 
of petite mutant 5 
(petite strain s- 

i a in /o \ 
10/ IIZ). 


0.00015 


RVZMTCYT 
BTJ 


Reventazonia sp; 
mitochondrial NADH 
dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RNA- 

Cpr of»np* Thi^ rodps 

for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.73 


286 


U89984 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds. 


0.00015 


ACU89984_1 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds; Similar 
to human 
transformation- 


4.20E-13 
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sensitive protein: 
SwissProt Accession 
Number P31948 




287 


AC002365 


Homo sapiens 
chromosome X 
clone U177G4, 
U152H5, 
U168D5, 174A6, 
U172D6, and 
U186B3 from 
Xp22, complete 
sequence. 


0.00011 


S 10340 


DNA-directed RNA 
polymerase (EC 
2.7.7.6) - yeast 
(Kluyveromyces 
marxianus var. lactis) 


0.00062 


288 


AC002390 


Human DNA 
from overlapping 
chromosome 19- 
specific cosmids 
R30072 and 
R28588, genomic 
sequence, 
complete 
sequence. 


9.90E-05 


D86603_l 


Mouse mRNA for Bach 
protein 1, complete cds; 
Bachl 


1 


289 


AC002980 


Homo sapiens; 
HTGS phase 1 , 
34 unordered 
pieces. 


9.20E-05 


TRBKPCYB_ 
1 


Trypanosoma brucei 
kinetoplast 

apocytochrome b gene, 
complete cds 


0.52 


290 


M99412 


Human 
interleukin-8 
receptor (IL8RB) 
gene, complete 
cds. 


4.50E-05 


S28832 


microtubule-associated 
protein HI (clone 
KS3.1) - longfm squid 
(fragment) 


0.88 


291 


AC000120 


Human BAC 
clone RG161K23 
from 7q21, 
complete 
sequence. 


4.00E-05 


SXSCRBAJ 


S;xylosus scrB and 
scrR genes; Sucrose 
repressor 


0.99 
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292 


AC003037 


Homo sapiens; 
HTGS phase 1, 
66 unordered 
pieces. 


3.40E-05 


SI 3569 


hypothetical protein 5 - 
Lactococcus lactis 
subsp. lactis insertion 
sequence 

1076>GP:LLTLE_1 
Lactococcus lactis 
DNA for the 
transposon-like element 
on the lactose plasmid; 
ORF5 (AA 1 - 43) 


0.018 


293 


Z81512 


Caenorhabditis 
elegans cosmid 
F25C8, complete 
sequence. 


2.40E-05 


MUSDBPRC 
1 


Mus musculus DNA- 
binding protein Rc 
mRNA, complete cds; 
DNA binding protein 
Rc 


1 


294 | 


B16681 


343C3.TVB 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
343C03. 


1.10E-05 


COPP YEAS 
T 


COATOMER BETA* 
SUBUNIT (BETA- 
COAT PROTEIN) 
(BETA'- 

COP)>PIR2:B55123 
coatomer complex beta' 
chain - yeast 
(Saccharomyces 
cerevisiae)>GPN:SCY 
GL137WJ 
S;cerevisiae 
chromosome VII 
reading frame ORF 

237_1 Saccharomyces 
cerevisiae 


0.081 


295 


Z 16523 


H. sapiens 
(D9S158)DNA 

containing (CA) 
repeat; clone 
AFM073ybll; 
single read. 


1.00E-05 


MMSEMFJ 


M;musculus mRNA for 
semaphorin F; 
Smaphorin F 


0 78 


296 


Z49704 


Sxerevisiae 
chromosome XIII 
cosmid 8021. 


5.60E-06 


<NONE> 


<NONE> 


<NONE> 


297 


AC003071 


Human BAC 
clone BK085E05 
from 22ql2.1- 
qter, complete 
sequence. 


3.00E-06 


HSRCAERJ 


H;sapiens mRNA for 
red cell anion 
exchanger (EPB3, AE1, 
Band 3) 3' non-coding 
region 


0.21 
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298 


U20428 


Human SNC19 
mRNA sequence. 


1 .40E-06 


HUMMUC2A 
J 


Human mucin-2 gene, 
partial cds 


4.40E-06 


299 


U51903 


Human RasGAP- 
related protein 
(IQGAP2) 
mRNA, complete 
cds. 


6.60E-07 


IQGA HUMA 
N 


RAS GTPASE- 
ACTIVA TING-LIKE 
PROTEIN IQGAP1 
(P195)>PIR2:A54854 
Ras GTPase activating- 
related protein - 
human>GP:HUMIQG 
Al Homo sapiens ras 
GTPase-activating-like 
protein (IQGAP1) 
mRNA, complete cds; 
Amino acid feature: IQ 
calmodulin-binding do 


1.60E-14 


300 


AL000805 


F.rubripes GSS 
sequence, clone 
021G08aAl. 


4.70E-07 


MT13 MYTE 
D 


METALLOTHIONEIN 
10-111 (MT-10- 
III)>PIR2:S39418 
metallothionein 10-111 - 
blue mussel 


2.20E-10 


301 


AC003016 


Human BAC 
clone RG134C19 
from 8q21, 
complete 
sequence. 


4.30E-07 


SPC57A10_5 


S;pombe chromosome I 
cosmid c57A10; 
Unknown; 
SPAC57A10;05;c, 
unknown, len:606aa, 
similar to A; nidulans 
Q00659, sulfur 
metabolite repression 
control, (678aa), fasta 
scores, opt: 1355, 


0.00041 


302 


AC003089 


Human BAC 
clone 

RG180F08A, 

complete 

sequence. 


3.80E-07 


HPBPRECK 
1 


Hepatitis B virus type 
1 1 precore protein (pre- 
C region, C) gene, 5' 
end 


0.41 ; 


303 


AC002074 


Human BAC 
clone GS056H 18 
from 7q31-q32, 
complete 
sequence. 


2.40E-07 


A47021J 


Sequence 23 from 
Patent W09527787; 
Unnamed protein 
product; Author-given 
protein sequence is in 
conflict with the 
conceptual 

translation>GP:A5 1 260 

1 Sequence 23 from 
Patent W09614416; 
Unnamed protein 
product; Author-given 


0.0016 
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protein sequence is i 




304 


U04980 


Rattus norvegicus 
fetal troponin T 3 
(fetal TnT3) 
mRNA, partial 
cds. 


2.20E-07 


HUMFSHDl 


Human 

facioscapulohumeral 
muscular dystrophy 
(FSHD) gene region, 
D4Z4 tandem repeat 
unit; ORF 


3.30E-08 


305 


U68704 


Human 
chromosome 
21q22.3 PI -clone 
3804 subclone 4- 
52. 


2.00E-07 


HHV6AGNM 
_96 


Human herpesvirus-6 
(HHV-6)U1102, 
variant A, complete 
virion genome; U88; 
Cys repeats; this loci is 
open in all six reading 
frames, part of IE- A 


1 70F-05 


306 


U51583 


Rattus norvegicus 
zinc finger 
homeodomain 
enhancer-binding 
protein- 1 (Zfhep- 
1) mRNA, partial 
cds. 


8.70E-08 


AF005370_67 


Alcelaphine 
herpesvirus 1 L-DNA, 
complete sequence; 
Putative immediate 
early protein; ORF73; 
similar to H; saimiri 
and KSHV ORF73 


6 10E-07 


307 


M80206 


Mus domesticus 
poliovirus 
receptor homolog 
(MrHJ mKJN A, 
complete cds. 


8.10E-08 


153960 


PRR2 alpha - human 


1 .70E-28 


308 


M60854 


Human ribosomal 
protein S 1 6 
mRNA, complete 
cds. 


5.70E-08 


OLVPOLJ 


Caprine arthritis 
encephalitis virus 
(isolate OVLV-Nl)pol 
protein gene, 3 ! end of 
cds; Nt 2497-2695 
from CAEV Co 


0.27 


309 


U82828 


Homo sapiens 
ataxia 

telangiectasia 
(ATM) gene, 
complete cds. 


1.50E-08 


C40201 


artifact-warning 
sequence (translated 
ALU class C) - human 


0.00044 
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310 


Z83836 


Human DNA 
sequence from 
PAC 111J24 on 
chromosome 
22ql2-qter 
contains ESTs. 


1 .40E-08 


HSU64473J 


Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial 
cds>GP:HSU64498J 
Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial cds 


0.34 


311 


Z50029 


Caenorhabditis 
elegans cosmid 
ZC504, complete 
sequence. 


1.40E-08 


MMU88984J 


Mus musculus NIK 
mRNA, complete cds 


1 .70E-50 


312 


AC002351 


Homo sapiens; 
HTGS phase 1, 
1 7 unordered 
pieces. 


1.20E-08 


D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp.>GP:HSNCOL4 1 
Hydra N-COL 4 
mRNA for mini- 
collagen; No start 
codon 


0.02 


313 


B65763 


CIT-HSP- 
2023A12.TR 
CIT-HSP Homo 
sapiens genomic 
clone 2023A12. 


3.60E-09 


SI8106 


type II site-specific 
deoxyribonuclease (EC 
3.1.21.4) Abrl- 
Azospirillum brasilense 


0.045 


314 


Z93021 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
516C23;HTGS 
phase 1. 


2.00E-09 


AB001684 13 
4 


Chlorella vulgaris C-27 
chloroplast DNA, 
complete sequence; 
RNA polymerase 
gamma subunit 


0.6 


315 


D88035 


Rat mRNA for 
glycoprotein 
specific UDP- 
glucuronyltransfe 
rase, complete 
cds. 


1.50E-09 


D88035_l 


Rat mRNA for 
glycoprotein specific 
UDP- 

glucuronyltransferase, 
complete cds 


l.OOE-33 
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316 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 


1.30E-10 


VGF1JBVB 


Fl 

PROTEIN>PIRl:VFIH 
Bl Fl protein - avian 
infectious bronchitis 
virus (strain 
Beaudette)>GP:IBACG 
B l Avian infectious 
bronchitis virus pol 
protein, spike protein, 
small virion-associated 
protein, membrane 
protein, and 
nucleocapsid protein 
gen 


1 


317 


B04719 


cSRL-42G12-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
CSRL-42G12. 


7.90E-11 


JC5238 


galactosylceramide-like 
protein, GCP - human 




3 1 o 


M73506 


Mouse Tcp- 1 0c (t 
allele) gene. 


2.80E-11 


A39487 


T-complex protein 1 Oa 
(allele \Zy) - mouse 


4.10E-16 


319 


U71148 


Human Xq28 
cosmids U225B5 
and U236A12, 
complete 
sequence- 


1.20E-11 


A56547 


sex-peptide precursor - 
Drosophila suzukii 


0.4 


320 


Z95116 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57G9; HTGS 
phase 1 . 


9.90E-13 


ALU2 HUM 
AN 


!!!! ALU 

SUBFAMILY SB 
WARNING ENTRY 
!!!! 


0 0017 1 


321 


i M64795 


Rat MHC class 1 
antigen gene 
(RTl-u 
haplotype), 
complete cds. 


1.70E-14 


STC_DROME 


SHUTTLE CRAFT 
PROTE1N>GP:DMUO 
9306_1 Drosophila 
melanogaster shuttle 
craft protein (stc) 
mRNA, complete cds; 
C-terminal 222 amino 
acids encode a novel 
single- stranded DNA 
binding domain 


1.40E-I3 
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322 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


4.20E-15 


AF010403_1 


Homo sapiens ALR 
mRNA, complete cds; 
Alternatively spliced; 
similarity to ALL-1 and 
Drosophila trithorax 


1 


323 


U 12523 


Rattus norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence. 


2.90E-15 


SPBC30D10 
4 


S;pombe chromosome 
II cosmid c30D10; 
Hypothetical protein; 
SPBC30D10;04, 
unknown, len:148aa 


2.40E-09 


324 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18; HTGS 
phase 1. 


2.20E-15 


RPON HAL 
MA 


DNA-DIRECTED 
RNA POLYMERASE 
SUBUNIT N (EC 
2.7.7.6)>PIR2:D41715 
DNA-directed RNA 
polymerase II chain 
RPB10 homolog- 
Haloarcula 

marismortui>GP:HAL 
HMAENOA_4 
H;marismortui tRNA- 
Leu, HL29, HmaL13, 
HmaS9, OrfMMV, 
OrfMNA, 2- 
phosphoglycerate 
dehydr 


0.019 


325 1 


M86917 


Human oxysterol- 
binding protein 
(OSBP) mRNA, 
complete cds. 


1.60E-15 


CEF14H8_2 


Caenorhabditis elegans 
cosmid F14H8, 
complete sequence; 
F14H8;1; Similarity to 
Human oxysterol- 
binding protein 
(SW:OXYB_HUMAN) 


2.10E-18 


326 


AC001231 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


1.30E-15 


AC002397_3 


Mouse BAC284H12 
Chromosome 6, 
complete sequence; 
DRPLA 


0.0016 


327 


AL008626 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
1114G22;HTGS 
phase 1. 


5.30E-16 


TAU48227J 


Triticum aestivum 
soluble starch synthase 
mRNA, partial cds 


5.90E-05 
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328 


L04483 


Human ribosomal 
protein S21 
(RPS21)mRNA, 
complete cds. 


7.60E-17 


RS21 HUMA 
N 


40S RIBOSOMAL 
PROTEIN 
S21>PIR2:S34108 
ribosomal protein S21 - 
human>GP:SSZ84015_ 
1 S;scrofa mRNA; 
expressed sequence tag 
(3'; clone cllglO); 40S 
ribosomal protein S21; 
Similar to human 40S 
ribosomal protein 
S21>GP:HUMRPS21X 
1 Human ribosomal 


1 .40E-09 


329 


AB001899 


Homo sapiens 
PACE4 gene, 
exon 2. 


6.70E-17 


LRP1 HUMA 
N 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 

1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) (A2MR) 
(APOLIPOPROTEIN 
E RECEPTOR) 
(APOER)>PIR2:S0239 

2 LDL receptor-related 
protein precursor - 
human>GP:HSLDLRR 
L_l Human mRNA for 
LDL-recept 


1 


330 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN rKUOKbbb 
*** from clone 
76C18; HTGS 
phase 1 . 


4.40E-17 


U97553_59 


Murine herpesvirus 68 
strain WUMS, 
complete genome; 

ix 1 Dun UL 1 C\J 1 1 U C 

reductase large 


0.06 


331 


AF017187 


Homo sapiens 
LTR HERV-K 
repetitive element 
fragment 
ltr_19_9a 
sequence. 


3.90E-18 


D84255J 


Ovophis okinavensis 
mitochondrial DNA for 
NADH dehydrogenase 
subunit 1 , partial cds, 
lle-tRNA, Pro-tRNA, 
Phe-tRNA, Gin- tRNA, 
Met-tRNA and control 
region (D-loop region); 
This cds 


0.007 
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P 
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P 
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332 


B36252 


HS-1038-A2- 
GOl-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
PIate=CT 820 
Col=2 Row=M. 


3.10E-18 


PGBM MOU 
SE 


BASEMENT 
MEMBRANE- 
SPECIFIC HEPARAN 
SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 
PRECURSOR (HSPG) 
(PERLECAN) 
(PLC)>PIR2:S 18252 
heparan sulfate 
proteoglycan - 
mouse>GP:MUSPERP 
A_l Mouse perlecan 
mRNA, complete cds 


0.00015 


333 


D78255 


Mouse mRNA for 
PAP-1, complete 
cds. 


2.70E-18 


MUSPAP1J 


Mouse mRNA for 
PAP-1, complete cds 


3.50E-18 


334 


AC003046 


Human Xp22 
PACs RPC1I- 
263P4 and 
RPC11-164K3 
complete 
sequence. 


1.40E-18 


CEC34F6J 


Caenorhabditis eiegans 
cosmid C34F6; 
C34F6;1;CDNA EST 
yk46bl2;5 comes from 
this gene; cDNA EST 
yk44c4;5 comes from 
this gene; cDNA EST 
yk46b!2;3 comes from 
this gene 


0.0015 


335 


AC003002 


Human DNA 
from overlapping 
chromosome 19- 
specific cosmids 
R29515and 
R28253, genomic 
sequence, 
complete 
sequence. 


1.40E-18 


MUSZFPOJ 


Mouse mRNA for zinc 
finger protein, partial 
sequence 


1.30E-19 


336 


Y 15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


3.40E-19 


HS4U2IR2J 


Epstein-Barr virus 
(AG876 isolate) U2- 
IR2 domain encoding 
nuclear protein 
EBNA2, complete cds; 
Nuclear antigen 2 


2.00E-06 


337 


Z97876 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
295C6; HTGS 


1.30E-19 


AF003535J 


Homo sapiens LI 
element ORF2-like 
protein gene, partial cds 


7.00E-05 
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Genbank) 


Nearest 
Neighbor 
(BKastX vs. 
Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 






phase 1. 










338 


M97159 


Mouse (clone 
pIL2)Bl 
dispersed repeat 
unit. 


1.10E-19 


A26882 


pIL2 hypothetical 
protein - rat 
(fragment)>GP:RATT 
DR_1 Rat growth and 
transformation- 
dependent mRNA, 3' 
end; Growth and 
transformation 
dependent protein 


0.2 ! 


339 


U30817 


Bos taurus very- 
long-chain acyl- 
CoA 

dehydrogenase 
mRNA, nuclear 
gene encoding 
mitochondrial 
protein, complete 
cds. 


4.70E-20 


ACDVRAT 


ACYL-COA 

DEHYDROGENASE, 

VERY-LONG-CHAIN 

SPECIFIC 

PRECURSOR (EC 

1.3.99.-) 

(VLCAD)>PIR2:A548 
72 acyl-CoA 
dehydrogenase (EC 
1.3.99.-) very-long- 
chain-specific 
precursor - 

rat>GP:RATVLCAD_ 
1 Rat mRNA for very- 
long-chain Acyl-CoA 
dehydrogenase, compl 


8.10E-25 


340 


Y11535 


H.sapiens mRNA 
for SHOXb 
protein. 


2.80E-20 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


0.00027 




ALUUo /3U 


Human din a 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
487J7; HTGS 
phase 1. 


"7 1 HP 1 1 




q *-f \'f cxft t*ti i n a 
aiU lal/l- WclT lllllg 

sequence (translated 
ALU class C) - human 


0 001 


342 


U96629 


Human 

chromosome 8 
BAC clone 
CIT987SK-2A8 
complete 
sequence. 


5.30E-23 


ALU! HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


3.80E-10 
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ID 
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P 
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P 
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343 


U95743 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


2.10E-24 


UROM HUM 
AN 


UROMODULIN 
PRECURSOR 
(TAMM-HORSFALL 
URINARY 
GLYCOPROTEIN) 
(THP)>PIR2:A30452 
uromodulin precursor - 
human>GP:HUMUMO 
D_l Human 
uromodulin (Tamm- 
Horsfall glycoprotein) 
mRNA, complete cds; 
Uromodulin precursor 


1 


344 


U 1 5972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


S20790 


extensin - 

almond>GP:PAEXTS_ 
1 P;amygdalus mRNA 
for extensin 


0.34 


345 


U 15972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


CA24 CAEE 
L 


COLLAGEN ALPHA 
2(IV) CHAIN 
PRECURSOR>GP:CE 
COLA2I V_2 C;elegans 
a2(IV) collagen gene; 
Alternatively spliced 
transcript 


0.1 


346 j 


Z66242 


H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
84a4, reverse read 
cpg84a4.rtla. 


4.80E-26 


CEC35A5_8 


Caenorhabditis elegans 
cosmid C35A5, 
complete sequence; 
C35A5;8; CDNA EST 
yk3 1 f6;5 comes from 
this gene; cDNA EST 
yk38hl;3 comes from 
this gene; cDNA EST 
yk38hl;5 comes from 
this gene; 


7.70E-19 


347 


L25331 


Rattus norvegicus 
lysyl hydroxylase 
mRNA, complete 
cds. 


3.90E-26 


LYSH CHIC 
K 


PROCOLLAGEN- 
LYSINE,2- 
OXOGLUTARATE 5- 
DIOXYGENASE 
PRECURSOR (EC 
1.14.11.4) (LYSYL 
HYDROXYLASE)>PI 
R2:A23742 
procollagen-lysine 5- 
di oxygenase (EC 
1.14.1 1.4) precursor - 
chicken>GP:CHKLYH 
1 Chicken lysyl 


1.10E-43 
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(BlastX vs. 
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Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












hydroxylase mRNA, 
complete cds 




348 1 


L81569 


Drosophila 
melanogaster 
(subclone 2 d7 
from PI DS04260 
(D68)) DNA 
sequence, 
complete 
sequence. 


3.30E-26 


CELC52B9_2 


Caenorhabditis elegans 
cosmid C52B9; Coded 
for by C; elegans 
cDNA cml ld6; weakly 
similar to S; cervisiae 
PTM1 precursor 
^or.rj^oj / ) 


8.40E-29 


349 


U78082 


Human RNA 
polymerase 
transcriptional 
regulation 
mediator (h- 
MED6) mRNA, 
complete cds. 


2.30E-26 


HSU78082J 


Human RNA 
polymerase 
transcriptional 
regulation mediator (h- 
MED6) mRNA, 
complete cds; H- 
Med6p 


1 .50E-16 


350 


U43381 


Human Down 
Syndrome region 
of chromosome 
21 DNA. 


2.10E-28 


HSMRNAEB 
1 


H;sapiens genomic 
DNA, integration site 
for Epstein-Barr virus; 
Hypothetical protein 


0.18 


351 


D50416 


Mouse mRNA tor 
AREC3, 
complete cds. 






prOSlaglailUlll"" 

endoperoxide synthase 
(EC 1.14.99.1) 
precursor - 

sheep>GP:SHPCOXA_ 
1 Sheep prostaglandin 
endoperoxide 
synthetase 
(cyclooxygenase), 
complete cds; 
Cyclooxygenase 
precursor (EC 
1;14;99;1) 


0.81 
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P 
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ACCESSION 
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P 
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352 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 


2.20E-29 


CFU30222J 


Crithidia fasciculata 
fully edited ATPase 
subunit 6 (MURF4) 
mRNA, partial cds; 
Cryptogene 


0.53 


353 


Z92826 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
C18D11;HTGS 
phase 1. 


1.10E-30 


SPAC1B3_5 


S;pombe chromosome I 
cosmid clB3; 
Hypothetical protein; 
SPAC1B3;05, probable 
transcriptional 
regulator, len:630aa, 
similar eg; to YIL038C, 
NOT3YEAST, 
P06102, general 
negative regulator, 


3.20E-35 


354 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


3.70E-32 


PVU72769J 


Phaseolus vulgaris 
PvPRP-12 (Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich 

protein>GP:PVU72769 

1 Phaseolus vulgaris 
PvPRP- 12 (Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich protein 


0.00049 


355 


B42455 


HS-1055-B2- 
G03-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 777 
Co!=6 Row=N. 


1.30E-32 


CELT05H4_8 


Caenorhabditis elegans 
cosmid T05H4; Similar 
to the beta transducin 
family; coded for by C; 
elegans cDNA 
ykl56el 1;3; coded for 
by C; elegans cDNA 
ykl4c8;3; coded for by 
C; elegans cDNA 


6.90E-14 


356 


AF001905 


Homo sapiens 
cosmids E079, 
B0920 and A8 
from Xq25 X- 
linked 

lymphoproliferati 
ve disease gene 
candidate region, 
complete 
sequence. 


L80E-33 


138344 


titin - human 


1 
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SEQ 
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ACCESSION 
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P 

VALUE 


357 


E03743 


DNA sequence 
including male 
hormone 
dependent gene 
derived from 
hamster 
frankorgan. 


1.10E-34 


CELC03A7_2 


Caenorhabditis elegans 
cosmid C03A7; Weak 
similarity to serotonin 
receptors 


0 59 


358 


U31199 


Human laminin 
gamma2 chain 
gene (LAMC2), 
exon 22 and 
flanking 
sequences. 


1.20E-35 


B44018 


laminin B2t chain - 
human>GP:HSLAMB2 
TB_1 H;sapiens mRNA 
for laminin 


1.20E-14 


359 


D14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds. 


2.00E-36 


D49544_l 


Mouse mRNA for 
KIFC1, complete cds 


1.20E-23 


360 


AB000425 


Porcine DNA for 
endopeptidase 
24.16, exon 16 
and complete cds. 


8.20E-38 


POL4 DROM 

E 


RETROVIRUS- 

RELATED POL 

POLYPROTEIN 

(PROTEASE (EC 

3.4.23.-); REVERSE 

TRANSCRIPTASE 

(EC 2.7.7.49); 

ENDONUCLEASE) 

(TRANSPOSON 

412)>PIR1:GNFF42 

reirovirus-reiaieu poi 

polyprotein - fruit fly 

(Drosophila 

melanogaster) 

transposon 

412>GP:DMRT412G_ 
4 


0.65 


361 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 


8.80E-42 


156333 


apolipoprotein B - rat 
(fragment)>GP:RATA 
POLPB_l Rattus 
norvegicus (clone 
rb9E) apolipoprotein B 
apoB mRNA, 3' end 


0.23 
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Nearest 
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Gen bank) 


Nearest 
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(BlastX vs. 
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SEQ 
ID 
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P 
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ACCESSION 
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P 
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362 


L09647 


Rattus norvegicus 
hepatocyte 
nuclear factor 3a 
(HNF-3 beta) 
mRNA, complete 
cds. 


6.60E-42 


HN3BRAT 


HEPATOCYTE 
NUCLEAR FACTOR 
3-BETA (HNF- 
3B)>GP:RATHNF3B_ 
1 Rattus norvegicus 
hepatocyte nuclear 
factor 3a (HNF-3 beta) 
mRNA, complete 
cds>TFD:TFDP01611 
- Polypeptides entry for 
factor HNF-3 (beta) 


8.10E-25 


363 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds. 


4.10E-43 


CELC34D4 1 
2 


Caenorhabditis elegans 
cosmid C34D4 


0.018 


364 


Z56764 


H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
13f7, reverse read 
cpgl3f7.rtla. 


1.40E-43 


S75263 


hypothetical protein - 
Synechocystis sp. (PCC 
6803)>GP:D90904_29 
Synechocystis sp; 
PCC6803 complete 
genome, 6/27, 630555- 
781448; Hypothetical 
protein; 

ORFJD:sll0983 


0.0028 


365 1 


AC002636 


*** 

SEQUENCING 
IN PROGRESS 
*** Drosophila 
melanogaster 
(subclone 2_g4 
from PI DS03323 
(D 127)) DNA 
sequence; HTGS 
phase 2. 


8.40E-44 


DMU9S760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


3.40E-51 


366 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds. 


8.00E-44 


GLSLRAT 


GLUTAMINASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) 

(GLS)>GP:RATGAH_ 
1 Rattus norvegicus L- 
glutamine 
amidohydrolase 
mRNA, complete cds 


8.00E-29 
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367 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds. 


5.00E-45 


DMU95760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


4.80E-45 


368 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


4.10E-45 


PTPK HUMA 
N 


PROTEIN-TYROSJNE 
PHOSPHATASE 
KAPPA PRECURSOR 
(EC 3. 1.3. 48) (R-PTP- 
KAPPA)>GP:HSPTPK 
AP_1 H;sapiens mRNA 
for phosphotyrosine 
phosphatase kappa; 
Human 

phosphotyrosine 
phosphatase kappa 


4.70E-16 


369 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3. 


9.40E-47 


MMU53563_1 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00012 


370 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds. 


8.10E-48 


HSU78310J 


Homo sapiens 
pescadillo mRNA, 
complete cds 


1.10E-21 


371 


AC000399 


Genomic 
sequence from 
Mouse 9, 
complete 
sequence. 


7.40E-48 


KIP2_YEAST 


KINESIN-LIKE 
PROTEIN 
KIP2>PIR1:C42640 
kinesin-related protein 
KIP2 - yeast 
(Saccharomyces 
cerevisiae)>GP:SCKIP 
2XVI 2 S;cerevisiae 
PEP4 and K1P2 genes 
encoding PEP4 
proteinase (partial) and 
kinesin-related protein 
KIP2>GP:SCLACHX 
VI_17 S;cerev 


0.14 


372 


AC002327 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Mouse 7; HTGS 
phase 1, 3 


1.40E-48 


CHKC1A205 
1 


Chicken alpha-2 type-1 
collagen; amino acids - 
16 to 3; Precollagen 
alpha-2 


0.024 
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SEQ 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 






unordered pieces. 










373 


X67016 


H.sapiens mRNA 
for amphiglycan. 


9.00E-49 


CED2085_2 


Caenorhabditis elegans 
cosmid D2085. 
complete sequence; 
D2085;l; Similar to 
glutamine-dependent 
carbamoyl-phosphate 
synthase, aspartate 
carbamoyltransferase, 
dihydroorotase; cDNA 
EST 

cml6f3>GP:CED2085 

2 Caenorhabditis 
elegans cosmid D2085; 
D 


0.14 


374 


LI 0409 


Mouse fork head 
related protein 
(HNF-3beta) 
mRNA, complete 
cds. 


1.50E-49 


MMU04 197J 


Mus musculus HNF3 
beta transcription factor 
(HNF3b) mRNA, 
partial cds; Sequence of 
this partial cDNA 
begins in the first third 
of the conserved 
HNF3/forkhead DNA 
binding domain 


1.20E-30 1 


375 


U01139 


Mus musculus 
B6D2F1 clone 
2C1 IB mRNA. 


1.20E-49 


SPBC3D5J4 


S;pombe chromosome 
II cosmid c3D5; 
Unknown; 
SPBC3D5;14c, 
unknown; partial; 
serine rich, len:309aa, 
similar eg; to 
YNL283C, 
YN23_YEAST, 
P53832, hypothetical 
52;3 kd protein, 
(503aa), 


0.00091 I 


376 


Z82170 


Human DNA 
sequence from 
PAC 326L13 
containing brain- 
4 mRNA ESTs 
and polymorphic 
CA repeat. 


9.00E-50 


BSU55043_3 


Bacillus subtilis 
plasmid pPOD2000 
Flep, RapAB, RapA, 
ParA, ParB, and ParC 
genes, complete cds; 
ORF3 


0.025 I 
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377 


j Z99289 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
142L7;HTGS 
phase 1. 


7.70E-50 


A64431 


hypothetical protein 
MJ1050- 
Methanococcus 
jannaschii>GP:MJU67 
548 2 Methanococcus 
jannaschii from bases 
986219 to 996377 
(section 90 of 1 50) of 
the complete genome; 
M; jannaschii predicted 
coding region MJ1050; 
Identified by 
Gene Mark; putativ 


5.60E-05 


378 


X98260 


H.sapiens mRNA 
for M-phase 
phosphoprotein, 
mppl 1. 


6.20E-50 


ZRF1 MOUS 
E 


ZUOTIN RELATED 
FACTOR>GP:MMU53 
208 1 Mus musculus 
zuotin related factor 
(ZRFl)mRNA, 
complete cds; Similar 
to DnaJ encoded by 
GenBank Accession 
Number LI 6953 


3.90E-30 


379 


M18981 


Human prolactin 
receptor- 
associated protein 
(PRA) gene, 
complete cds. 


9.00E-52 


SI 06 HUMA 
N 


CALCYCLIN 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 
PROTEIN) (PRA) 
(GROWTH FACTOR- 
INDUCIBLE 
PROTEIN 2A9) (SI 00 
CALCIUM-BINDING 
PROTEIN 
A6)>PIR1:BCHUY 
calcyclin - 

human>GP:HUMCAC 
Y_l Human calcyclin 
gene, complete 
cds>GP:HUMCACYA 

1 Human prolactin 
recept 


8.80E-24 


380 


AB006622 


Homo sapiens 
mRNA for 
K1AA0284 gene, 
partial cds. 


1.60E-53 


S33015 


hypothetical protein - 
human herpesvirus 4 


0.00088 
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P 
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381 


U53225 


Human sorting 
nexin 1 (SNX1) 
mRNA, complete 
cds. 


1.80E-55 


G02522 


sorting nexin 1 - 
human>GP:HSU53225 

1 Human sorting 
nexin 1 (SNX1) 
mRNA, complete cds 


9.20E-50 


382 


Z92844 


Human DNA 
sequence from 
PAC 435C23 on 
chromosome X. 
Contains ESTs. 


6.50E-56 


D 14487_1 


Lentinus edodes 
Le;MFBl mRNA, 
complete cds 


1 


383 


D87450 


Human mRNA 
for K1AA0261 
gene, partial cds. 


4.30E-56 


D87450_l 


Human mRNA for 
K1AA0261 gene, 
partial cds; Similar to 
D;melanogaster parallel 
sister chromatids 
protein 


4.30E-30 


384 


AC002301 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
chromosome 
+ 16pll.2 BAC 
clone CIT987SK- 
A-328A3; HTGS 
phase 2, 1 
ordered pieces. 


9.80E-57 


S62328 


kinesin-like DNA 
binding protein KID - 
human>GP:HUMKID_ 
1 Human mRNA for 
Kid (kinesin-like DNA 
binding protein), 
complete cds 


2.60E-27 


385 


L29766 


Homo sapiens 
epoxide hydrolase 
(EPHX) gene, 
complete cds. 


7.30E-57 


HSBCTCF4_1 


Homo sapiens mRNA 
for hTCF-4 


2.30E-05 


386 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin. 


3.30E-58 


MMU58884J 


Mus musculus SH3- 
containing protein 
SH3P7 mRNA, 
complete cds; similar to 
Human Drebrin; SH3- 
containing protein; 
similar to human 
drebrin 


6.00E-43 


387 


Y 15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


9.50E-59 


RNY15054J 


Rattus norvegicus 
mRNA for 70 kDa 
tumor specific antigen, 
partial; 70 kD tumor- 
specific antigen 


4.70E-45 
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Nea rest 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


388 


AC000406 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 1 1 
overlapping pacs 
pDJ235kl0and 
P DJ239b22; 
HTGS phase 1, 
1 7 unordered 
pieces. 


7.40E-59 


<NONE> 


<NONE> 


<NONE> 


389 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds. 


3.60E-59 


KRHUEA 


keratin, type II 
cytoskeietal - human 
(fragment)>GP:HSKE 
RA_1 Human 
messenger fragment 
encoding cytoskeietal 
keratin (type II); 
mRNA from cultured 
epidermal cells from 
human 

foreskin>GP:HUMKE 
R56KJ Human 56k 
cytoskeietal type II 
keratin mRNA 


7.60E-30 


390 


L29766 


Homo sapiens 
epoxide hydrolase 
(EPHX) gene, 
complete cds. 


2.70E-60 


EGR2 HUMA 
N 


EARLY GROWTH 
RESPONSE PROTEIN 
2 (EGR-2) (KROX-20 
PROTEIN) 

(AT591)>GP:HUMEG 
R2A1 Human early 
growth response 2 
protein (EGR2) 
mRNA, complete 
cds>TF D : TF DP004 8 5 
- Polypeptides entry for 
factor Egr-2 


7.80E-06 


391 


L08758 


Mus musculus 
homeobox protein 
(Hox A 10) gene, 
5' end of cds. 


1.40E-60 


PAALGYGE 
N_l 


P;aeruginosa algY 
gene; Alginate lyase 


0.00031 


392 


129058 


Sequence 3 from 
patent US 
5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2 - 

human>GP:D50645_I 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 


1.50E-32 



237 



BNSDOCID: <WO_9933982A2J_> 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VAI I TIP 


ACCESSION 


DESCRIPTION 


P 

VALUE 












factor-2 




393 


129058 


Sequence 3 from 
patent US 

5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2 - 

human>GP:D50645_l 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 
factor-2 


1.50E-32 


394 


U46067 


Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 


1.90E-62 


CHU46067J 


Capra hircus beta- 
mannosidase mRNA, 
complete cds 


2.70E-39 


395 


U40747 


Mus musculus 
formin binding 
protein 1 1 
mRNA, partial 
cds. 


6.90E-63 


S64713 


formin binding protein 
1 1 - mouse 

(fragment)>GP:MMU4 
074 7_1 Mus musculus 
formin binding protein 
1 1 mRNA, partial cds; 
FBP 11; Formin 
binding protein 1 1 ; 
tandem WWP/WW 
domains separated by 
15 amino acid linker 


3.00E-46 


396 


M36164 


Human 

glyceraldehyde-3- 
phosphate 
dehydrogenase 
mRNA, 3' flank. 


1.10E-63 


BHT1UL_12 


Bovine herpesvirus 
type 1 UL22-35 genes; 
UL26;5>GP:BHU3180 
9_2 Bovine herpesvirus 
1 maturational 
proteinase (UL26) 
gene, complete cds, and 
scaffold protein 
(UL26;5) gene, 
complete cds 


0.003 


397 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


7.30E-65 


MMU39060J 


Mus musculus 
glucocorticoid receptor 
interacting protein 1 
(GRIP 1) mRNA, 
complete cds; 
Hormone-dependent 
interaction with 
hormone binding 
domains of steroid 
receptors; 


0.0054 
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Nearest 
Neighbor 
(BlastX vs. 
Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












transact i vat ion 




398 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


2.70E-70 


JC4239 


phospholipase A2- 
activating protein - rat 


8.40E-17 


399 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds. 


1.70E-74 


KIF4_MOUS 
E 


K1NESIN-LIKE 
PROTEIN 
KIF4>PIR2:A54803 
microtubule-associated 
motor KIF4 - 
mouse>GP:MUSKIF4_ 
1 Mouse kif4 mRNA 
for microtubule-based 
motor protein KIF4, 
complete cds; ATP- 
binding site: base980- 
1037, motor domain: 
base732- 1781, alpha- 
helical CO 


1.10E-44 


400 


AF007860 


Xenopus laevis 
xl-Mago mRNA, 
complete cds. 


4.60E-75 


AF0O7862J 


Mus musculus mm- 
Mago mRNA, 
complete cds; Similar 
to urosopnua 
melanogaster Mago 
protein 


6.50E-68 


401 


145565 


Sequence 15 from 
patent US 
5637463. 


2.30E-82 


RNU57391J 


Rattus norvegicus 
FceRI gamma-chain 
interacting protein 
SH2- B (SH2-B) 
mRNA, complete cds; 
Putative FceRI gamma 
ITAM interacting 
protein; SH2 domain- 
containing protein B; 
Method: conceptual 


9.90E-42 
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Neighbor 
(BlastX vs. 
Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


402 


U29156 


Mus musculus 
eps!5R mRNA, 
complete cds. 


1.00E-85 


MMU29156J 


Mus musculus epsl5R 
mRNA, complete cds; 
Involved in signaling 
by the epidermal 
growth factor receptor; 
Method: conceptual 
translation supplied by 
author 


4.90E-62 


403 


U70139 


Mus musculus 
putative CCR4 
protein mRNA, 
partial cds. 


1 .OOE-85 


MMU70139J 


Mus musculus putative 
CCR4 protein mRNA, 
partial cds; Similar to 
yeast transcription 
factor CCR4; 
transcriptional 
readthrough occurs 
with transcription being 
initiated at the IAP and 
continues 


7.20E-66 


404 


U82626 


Rattus norvegicus 
basement 
membrane- 
associated 
chondroitin 
proteoglycan 
Bamacan mRNA, 
complete cds. 


7.60E-96 


RNU82626_1 


Rattus norvegicus 
basement membrane- 
associated chondroitin 
proteoglycan Bamacan 
mRNA, complete cds; 
Chondroitin sulfate 
proteoglycan; CSPG 


8.20E-58 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


405 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


2.00E-35 


<NONE> 


<NONE> 


<NONE> 


406 


AB000516 


Homo sapiens 
mRNA for DSIF 
pi 60, complete 
cds 


0.41 


POLG_TUMVQ 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; VPG 
PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (Nl-A) 


2.9 


407 


Z94753 


Human DNA 
sequence from 
PAC 465G10on 
chromosome X 
contains Menkes 
Disease (ATP7A) 
putative Cu++- 
transporting P- 
type ATPase 
exons 22, 23 and 
STS 


0.004 


<NONE> 


<NONE> 


<NONE> 


408 


AB011123 


Homo sapiens 
mRNA for 
K1AA0551 
protein, partial 
cds 


0 


MI15_CAEEL 


Q23356 

caenorhabditis 

elegans. 

serine/threonine- 
protein kinase mig- 
15(ec 2.7.1.-). 11/98 


2.00E-51 


409 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3 


e-123 


NARG_BACSU 


NITRATE 
REDUCTASE 
ALPHA CHAIN (EC 
1.7.99.4) 


9.9 


410 


M95098 


Bos taurus 
lysozyme gene 
(cow 2), complete 
cds 


11 


HAIR_MOUSE 


HAIRLESS 
PROTEIN 


8.00E-10 
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SEQ 
ID 


> ACCESSION 


1 DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


1 P VALUE 


411 


Z60048 


H. sapiens CpG 
DNA, clone 
1 87a9, reverse 
read 

cpgl87a9.rtla. 


4.00E-54 


HN3BJVIOUSE 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


4.00E-21 j 


412 


Z48975 


P.magnus gene 
for protein urPAB 


0.014 


YPT2CAEEL 


HYPOTHETICAL 
21.6 KD PROTEIN 
F37A4.2 IN 
CHROMOSOME III 


2.00E-12 


413 


AJ001296 


Notophthalmus 
viridescens 
mRNA for 
cytokeratin 8 


0.37 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03 IN 
CHROMOSOME I 


5.00E-21 


414 


J0383 1 


Xenopus laevis 
(clone pXEC1.3) 
C protein mRNA, 
complete cds. 


0.37 


PDR5YEAST 


SUPPRESSOR OF 
TOXICITY OF 
SPORIDESMIN 


3.3 


415 


AB007157 


Homo sapiens 
gene for 

ribosomal protein 
S21, partial cds 


e-142 


RS21_HUMAN 


40S RIBOSOMAL 
PROTEIN S21 


0.002 


416 


X86340 


H. sapiens C7 
gene, exon 13 


3.3 


STCDROME 


SHUTTLE CRAFT 
PROTEIN 


4.3 


417 


U 12404 


Human Csa-19 
mRNA, complete 
cds. 


0 


R10A__PIG 


60S RIBOSOMAL 
PROTEIN LI OA 
(CSA-19) 
(FRAGMENT) 


9.00E-57 


418 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 1 


419 


M80198 

< 


Human FKBP-12 
pseudogene, clone 
lambda-512, 5* 
flank and 
complete cds. 


5.00E-14 


RCO!_NEUCR 


TRANSCRIPTIONA 
L REPRESSOR 
RCO-1 j 


0.008 


420 


AF052573 ] 
] 
( 
i 
< 


Homo sapiens 
DNA polymerase 

»ta f POT T-H 

nRNA, complete 
:ds 


0 


<NONE> 


<NONE> j 


<NONE> 


421 


AF035940 1 

r 

c 


-lomo sapiens 
V1AGOH mRNA, 
:omplete cds 


e-131 


MGNDROME I 
I 


VIAGO NASHI i 
PROTEIN 


4.00E-39 


422 


AF054994 I 
c 
r 


-lomo sapiens 
ilone 23832 
nRNA sequence 


0.12 | 


<NONE> 


<NONE> 


<NONE> 
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ACCESSION 


DESCRIPTION 
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423 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


424 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


425 


D43952 


Mouse gene for 
reticulocalbin, 
exonl and 
promoter region 


0.36 


<NONE> 


<NONE> 


<NONE> 


426 


X68553 


C.elegans 
repetitive DNA 
sequence 


0.4 


TCB1_RABIT 


T-CELL 

RECEPTOR BETA 
CHAIN 

PRECURSOR (ANA 
11) 


0.11 


427 


M83314 


Tomato 
phenylalanine 
ammonia lyase 
(pal) gene, 
complete cds and 
promoter region. 


3.3 


SMB2_HUMAN 


DNA-BINDING 
PROTEIN SMUBP-2 
(GLIAL FACTOR- 1) 
(GF-1) 


0.65 


428 


AF070636 


Homo sapiens 
clone 24686 
mRNA sequence 


5.00E-23 


<NONE> 


<NONE> 


<NONE> 


429 


<NONE> 


<NONE> 


<NONE> 


IQGAHUMAN 


RAS GTPASE- 
ACTIVATING- 
LIKE PROTEIN 
IQGAP1 (P195) 


2.00E-06 


430 


AF068627 


Mus muscuius 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


LOXl_LENCU 


LIPOXYGENASE 
(EC 1.13.11.12) 


9.9 


431 


AF020043 


Homo sapiens 
chromosome- 
associated 
polypeptide 


0 


YJH4YEAST 


HYPOTHETICAL 
141.3 KD PROTEIN 
IN SCP160-MRPL8 
INTERGENIC 
REGION 


4.00E-16 


432 


K00046 


ross river virus 
26s subgenomic 
rna and junction 
region. 


0.12 


CUL2J-IUMAN 


CULLIN 
HOMOLOG 2 
(CUL-2) 


7.4 
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SEQ 
ID 
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! DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


433 


AF005664 


Homo sapiens 
properdin (PFC) 
gene, complete 
cds 


0.005 


UL88__HCMVA 


PROTEIN UL88 


5.8 


434 


Z70705 


H.sapiens mRNA 
(fetal brain cDNA 
com5) 


2.00E-05 


PH87_YEAST 


INORGANIC 
PHOSPHATE 
TRANSPORTER 
PH087 


1.5 


435 


U29156 


Mus musculus 
epsl5RmRNA, 
complete cds. 


e-125 


! EP15_HUMAN 


EPIDERMAL 
GROWTH FACTOR 
RECEPTOR 
SUBSTRATE 
SUBSTRATE 15 
(PROTEIN EPS 15) 
(AF- IP PROTEIN) 


1.00E-13 


436 


AE000750 


Aquifex aeolicus 
section 82 of 109 
of the complete 
genome 


0.37 


<NONE> 


<NONE> 


<NONE> 


437 


U49169 


Dictyostelium 
discoideum V- 
ATPase A subunit 
(vatA) mRNA, 
complete cds 


0.12 


VCAP_HSV6U 


MAJOR CAPSID 
PROTEIN (MCP) 


5.6 


438 




Homo sapiens 
uncoupling 
protein 3 (UCP3) 
gene, exon 1 and 
paniai exon z 


0.13 


WEEl_SCHPO 


MITOSIS 
INHIBITOR 
PROTEIN KINASE 
WEE1 (EC 2.7.1.-) 


3.7 


439 


AB000425 


Porcine DNA for 
endopeptidase 
24.16, exon 16 
djju complete cus 


4.00E-32 


<NONE> 


<NONE> 


<NONE> 


440 


U51037 


Mus musculus 11- 
zinc-finger 
transcription 
factor 


0.04 


<NONE> 


<NONE> 


<NONE> 


441 


( 
< 


Homo sapiens 
ubiquitin 
conjugating 
enzyme G2 


e- 1 1 0 


<NONE> 


<NONE> 


<NONE> 


442 


AF009288 1 
< 
< 
i 


-lomo sapiens 
:lone HEB8 Cri- 
iu-chat region 
tiRNA 


2.00E-14 


LMG INHUMAN I 
< 
] 
( 
< 


LAMININ 

3amma-1 chain 
precursor 
laminin b2 
:hain) 


8.1 
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ID 

~443~ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


AF024578 


Homo sapiens 
type-1 protein 
phosphatase 
skeletal muscle 
glycogen 
tareetine subunit 
(PPPlR3)gene, 
exon 4, and 
complete cds 


1.1 


<NONE> 


<NONE> 


<NONE> 


444 


M24486 


Human prolyl 4- 
hydroxylase alpha 
subunit mRNA, 
complete cds, 
clone PA-11. 


0 


DACHA 


<NONE> 


4.00E-58 


445 


X96400 


P.tetraurelia j 
alpha-51D gene 


0.37 


<NONE> 


<NONE> 


<NONE> 


446 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


/• /in 
447 


X84996 


X.laevis mRNA 
for selenocysteine 
tRNA acting 
factor (Staf) 


0.12 


POL_MLVRD j 


POL 

POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASEH 
(EC 3.1.26.4)) 


2.00E-08 


i t £ * o 


AFO 19980 


Dictyostelium 
discoideum ZipA 
(zipA) gene, 
partial cds 


3.4 


HMDLBRAh L 


HUM dud D A 
PROTEIN DLL 
HOMOLOG 




449 


X78424 


D.carota (Queen 
Anne's Lace) 
Inv*Dc2 gene, 
3432bp 


0.38 


<N(JNL> 






450 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


451 


1 X89886 


P.patens mRNA 
for 5- 

aminolevulinate 
dehydratase 


1.1 


CKR6HUMAN 


C-C CHEMOKINE 
RECEPTOR TYPE 6 
(C-C CKR-6) 
(CCR6) 


9.9 


452 


U67471 


Methanococcus 
jannaschii section 
13 of 150 of the 
complete genome 


0.12 


YR72_ECOLI 


HYPOTHETICAL 
53.2 KD PROTEIN 
(ORF2) (RETRON 
EC67) 


5.8 


453 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 
106 (Zfpl06) 
mRNA, H3a-a 
allele, complete 
cds 


1.00E-62 


YOJ8_CAEEL 


HYPOTHETICAL 
51.6 KD PROTEIN 
ZK353.8 IN 
CHROMOSOME III 


1.7 
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SEQ 
ID 

°454 


ACCESSIOh 


I DESCRIPTION 


P VALUE 


ACCESSION 


; DESCRIPTION 


P VALUE 


U70667 


Human Fas-iigan< 
associated factor 
1 mRNA, partial 
cds 


i 0 


YKB2YEAST 


HYPOTHETICAL 
69.1 KD PROTEIN 
IN PUT3-CCE1 
INTERGENIC 
REGION 


3.00E-09 


455 


M95858 


Bos taurus 
recover in mRNA 
complete cds. 


0.35 


GIDAMYCGE 


GLUCOSE 

TX TUT DTTDn 

IJNHiol 1 bD 
DIVISION 
PROTEIN A 


1.4 


456 


U67594 


Methanococcus 
iannaschii section 

JUIIIIUOVJJII Jvvllvll 

136 of 150 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


457 


X06747 


Human hnRNP 
core protein Al 


3.00E-31 


<NONE> 


<NONE> 


<NONE> 


458 


Z65575 


H.sapiens CpG 
DNA, clone 47c5, 
reverse read 
cpg47c5.rtla . 


1.3 


| <NONE> 


<NONE> 


<NONE> 


459 


X88893 


C.jacchus intron 4 
of visual pigment 
gene 


5.00E-15 


<NONE> 


<NONE> 


<NONE> 


460 


M57426 


Maize stripe virus 
RNA3 
nonstructural 
protein 


0.33 


DSC2_MOUSE 


DESMOCOLLIN 
2A/2B 

PRECURSOR 
(EPITHELIAL 
TYPE 2 

DESMOCOLLIN) 


6.5 


461 


X01638 


Yeast TEF1 gene 
for elongation 
factor EF-1 alpha 


1.1 


PPOL_DROME 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


3.5 


462 


M60064 


S.typhimurium 
glutamate 1- 
semialdehvde 
aminotransferase 
(hemL) gene, 
complete cds. 


1.1 


EPB4_MOUSE 

] 


EPHRIN TYPE-B 
RECEPTOR 4 
rlvbCUKaOR (EC 
2.7.1.1 12) KINASE 
2) (TYROSINE 
KINASE MYK- 1) 


2.5 I 


463 


X51508 ] 

4 


Rabbit mRNA for 
aminopeptidase N 
partial) 


0.36 


ACHGXENLA , 
3 
1 
( 
I 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
ZHAIN 
PRECURSOR 


1.5 


464 


L10106 

I 

F 

r 

c 


VIus musculus 
Drotein tyrosine 
)hosphate 
nRNA, complete 
:ds. 


2.00E-58 


VG13 BPML5 ( 


3ENE 13 PROTEIN 
GP13) 


2.5 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


465 


M77235 . 


Human cardiac 
tetrodotoxin- 
insensitive 
voltage-dependent 
sodium channel 
alnha subunit 
(HHl)mRNA, 
complete cds. 


3.8 


ZPBOC1 


<NONE> 


6.9 


466 


M58330 


C.maltosa 
autonomously 

rpnliratinp 
sequence. 


0.004 


EPB4JvlOUSE 


EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (FC 
2.7.1.112) KINASE 
2) (TYROSINE 
KINASE MYK- 1) 


2.4 


467 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.35 


ACHGXENLA 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 


2.4 


468 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


7.00E-59 


VGLI_PRVRI 


GLYCOPROTEIN 
GP63 PRECURSOR 


4.3 


469 


U65939 


Azotobacter 
vinelandii GTPase 
(ftsA) gene, 
partial cds, and 
ATP binding 
protein (ftsZ) 
gene, complete 
cds 


1.1 j 


TRUA_BACSP 


Q45557 bacillus sp. 
(strain ksm-64). trna 
pseudouridine 
synthase a (ec 
4.2.1.70) 
(pseudouridylate 
synthase i) 
(pseudouridine 

*>y 1 1 ii laoc i j ^uncivil 

hydroiyase). 1 1/98 


0.001 


470 


U51037 


Mus musculus 1 1- 
zinc-finger 
transcription 
factor 


0.041 


<NONE> 


<NONE> 


<NONE> 


471 


M32685 


Human platelet 
glycoprotein Ilia, 
exon 14. 


3.6 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


472 


U82691 


Phrynocephalus 
raddei CAS 
179770 NADH 
dehydrogenase 
subunit 1 (ND1), 
partial cds, tRNA- 
Gln, tRNA-IIe 
and tRNA-Met, 
NADH 

dehydrogenase 
subunit 2 tRNA- 
Cys and tRNA- 
Tyr and c... 


1.1 


<NONE> 


<NONE> 


<NONE> 


473 


D85430 


Mouse Murrl 
mRNA, exon 


0.12 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 
2.7.1.112) 


2.5 


474 


U20661 


Dictyostelium 
discoideum 
unknown internal 
repeat protein 
gene, complete 
cds, and unknown 
orfl, orf2 and 
orf3 genes, partial 
cds 


0.36 


YHL1JEBV 


HYPOTHETICAL 
BHLF1 PROTEIN 


4.00E-04 


475 


X56537 


Human novel 
homeobox mRNA 
for a DNA 
binding protein 


0.04 


FA5_HUMAN 


COAGULATION 
FACTOR V 
PRECURSOR 
(ACTIVATED 
PROTEIN C 
COFACTOR) 


9.5 


476 


U32843 


Haemophilus 
influenzae Rd 
section 158 of 163 
of the complete 
genome 


5 


<NONE> 


<NONE> 


<NONE> 


477 


U67554 

< 


Vlethanococcus 
annaschii section 
96 of 150 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


478 


AB004244 ] 
i 

< 

( 


Narke japonica 
mRNA for Nj- 
synaphin lb, 
;omplete cds 


1.1 


NIA1 ORYSA 1 

1 


NITRATE 

REDUCTASE 1 (EC 
1 .6.6. 1)(NR1) 


1.00E-07 


479 


AF075079 J 
1 
c 


-iomo sapiens full 
ength insert 
:DNA YQ80A08 


1.00E-12 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


480 


AE000723 


Aquifex aeolicus 
section 55 of 109 
of the complete 
genome 


1 


YKK0_YEAST 


HYPOTHETICAL 
67.5 KD PROTEIN 
IN APE1/LAP4- 
CWP1 

INTERGENIC 
REGION 


9.1 


AO 1 

481 


X73902 


H.sapiens mRNA 
for nicein B2 
chain 


0 


LMG2HUMAN 


LAMININ 
GAMMA-2 CHAIN 
PRECURSOR 


3.00E-93 


482 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-10 


P53_CRIGR 


CELLULAR 
TUMOR ANTIGEN 
P53 


5.7 


483 


ALO 10240 


Plasmodium 
falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-64, complete 
sequence 


1.2 


<NONE> 


<NONE> 


<NONE> 


484 


U49919 


Arabidopsis 
thai tana lupeol 
synthase mRNA, 
complete cds 


0.54 


YA53_SCHPO 


HYPOTHETICAL 
24. Z ivD rKUl ill IN 
C13A11.03 IN 
CHROMOSOME I 


6.00E-10 


485 


AF0776I8 


Homo sapiens 
p73 gene, exon 3 


0.39 


MYODJvlOUSE 


MYOBLAST 
DETERMINATION 

rKU 1 HI IN I 


2.1 


A OZT 

486 


AF054994 


Homo sapiens 
clone 23832 
mRNA sequence 


0.13 


<NONE> 


<NONE> 


<NONE> 


487 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


488 


AF068627 


Mus musculus 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


ACE2_YEAST 


METALLOTHIONE 
IN EXPRESSION 
ACTIVATOR 


1.5 


489 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-07 


R1NI_P1G 


RIBONUCLEASE 
INHIBITOR 


0.19 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


490 


L77886 


Human protein 
tyrosine 
phosphatase 
mRNA, complete 
cds 


1.00E-21 


VS48_TBRVS 


SATELLITE RNA 
48 KD PROTEIN 


1.6 


491 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


5.00E-04 


CRP3_LIMPO 


C-REACTIVE 
PROTEIN 3.3 
PRECURSOR 


3.5 


492 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 
2.7.1.112) 


2.7 


493 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-09 


<NONE> 


<NONE> 


<NONE> 


494 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.37 


<NONE> 


<NONE> 


<NONE> 


495 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.37 


NCPRYEAST 


NADPH- 
CYTOCHROME 
P450 REDUCTASE 
(EC 1.6.2.4) (CPR) 


7.0OE-05 


496 


U95I02 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.013 


YMB3CAEEL 


PROBABLE 
INTEGRIN ALPHA 
CHAIN F54G8.3 
PRECURSOR 


3.3 


497 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


498 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-10 


<NONE> 


<NONE> 


<NONE> 


499 


U95102 

< 

< 


Xenopus laevis 
mitotic 

phosphoprotein 
?0 mRNA, 
complete cds 


2.00E-07 


VGLY LYCVW 

1 

< 
( 


GLYCOPROTEIN 

POLYPROTEIN 

PRECURSOR 

CONTAINS: 

GLYCOPROTEINS 

31 AND G2) 


3.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DFSCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


500 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-06 


HR78DROME 


NUCLEAR 

HORMONE 

RECEPTOR HR78 

(DHR78) 

(NUCLEAR 

RECEPTOR 

XR78E/F) 


2.5 


501 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


MYSH_BOVIN 


MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


4.00E-04 


502 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-04 


BAL_HUMAN 


BILE-SALT- 
ACTIVATED 
LIPASE 

PRECURSOR (EC 
3.1.1.3) (EC 
3.1.1.13) (BAL) 
(BILE-SALT- 
STIMULATED 
LIPASE) (BSSL) 
ESTERASE) 
(PANCREATIC 
LYSOPHOSPHOLIP 
ASE) 


2.6 


CAT 


AF080399 


Drosophila 
melanogaster 
mitotic 
checkpoint 
control protein 
kinase BUB1 
(Bub 1) mRNA, 
complete cds 


1.1 


NAT 1 YEA ST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


2.00E-23 




U59706 


Gallus gallus 
alternatively 
spliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


505 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 


506 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


I DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


507 


AF 100661 


Caenorhabditis 
elegans cosmid 
H20E11 


0.38 


<NONE> 


<NONE> 


<NONE> 


508 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-11 


CA1AHUMAN 


COLLAGEN 
ALPHA 1(X) 
CHAIN 
PRECURSOR 


0.024 


509 


U47322 


Cloning vector 
DNA, complete 
sequence. 


2.00E-38 


COA1_SV40 


COAT PROTEIN 
VP1 


6.2 


510 


At 03 1924 


Homo sapiens 
homeobox 
transcription 
factor barx2 


e-156 


CCMAHAEIN 


HEME EXPORTER 
PROTEIN A 
(CYTOCHROME C- 
TYPE BIOGENESIS 
ATP-BINDING 
PROTEIN CCMA) 


3.5 


511 


AFO 10484 


Homo sapiens ICI 
YAC9IA12, right 
end sequence 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


512 


Z63829 


H. sapiens CpG 
DNA, clone 90h2, 
forward read 
cpg90h2.ftla. 


5.00E-22 


NFIRMESAU 


NUCLEAR 
FACTOR 1 CLONE 
PNF1/RED1 (NF>I) 
(CCAAT-BOX 
BINDING 
TRANSCRIPTION 
FACTOR) (CTF) 
(TGGCA-BINDING 
PROTEIN) 


2.4 


513 


Z35094 


H.sapiens mRNA 
for SURF-2 


5.00E-97 


SUR2_HUMAN 


SURFEIT LOCUS 
PROTEIN 2 


1 .00E-46 


514 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
Luinpjete cus 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 


515 


D38417 

i 
( 


Mouse mRNA for 
arylhydrocarbon 
receptor, 
complete cds 


e-154 


TEGUEBV 

] 


LARGE 

TEGUMENT 

PROTEIN 


3.4 


516 


L10911 ] 

< 

< 


4omo sapiens 
jplicing factor 
CC1.4) mRNA, 
:omplete cds. 


e-117 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 






P V A I I TP 

r V ALUC 


AL^-Cj jl WIN 


OF SCRIPT] ON 


P VAI 1 IF 


517 


XI 7093 


Human HLA-F 
gene for human 
leukocyte antigen 
F 


0.009 


YENl_SCHPO 


013695 

sch i zosacc haromy ces 
pombe (fission 
yeast), hypothetical 
52.9 kd serine-rich 
protein cl lg7.01 in 
chromosome i. 1 1/98 


5.4 


518 


ABO 17026 


Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYBHUMAN 


OXYSTEROL- 
BINDING PROTEIN 


1 .00E-40 


519 


X55038 


Mouse mCENP-B 
gene for 
centromere 
autoantigen B 


0.001 


YNW7_YEAST 


HYPOTHETICAL 
68.8 KD PROTEIN 
IN URE2-SSU72 
INTERGENIC 
REGION 


3.00E-04 


520 


ABO 18323 


Homo sapiens 
mRNA for 
K1AA0780 
protein, partial 
cds 


3.00E-41 


LBRCHICK 


LAMIN B 
RECEPTOR 


2.3 


521 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-10 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 
PRECURSOR 


0.002 


522 


X03558 


Human mRNA 
for elongation 
factor 1 alpha 
subunit 


0 


EF1 1_HUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-1 -ALPHA- 1) 


e-110 


523 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-11 


YMT8YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP1 16-FAR3 
INTERGENIC 
REGION 


8.00E-07 


524 


AB014591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


0 


NOT2YEAST 


GENERAL 
NEGATIVE 
REGULATOR OF 
TRANSCRIPTION 
SUBUNIT 2 


8.00E-05 


525 


AB019488 


Homo sapiens 
DNA for TRKA, 
exon 1 7 and 
complete cds 


0 


TRKAHUMAN 


HIGH AFFINITY 

NERVE GROWTH 

FACTOR 

RECEPTOR 

PRECURSOR 

PROTEIN) (PI 40- 

TRKA) 


2.00E-27 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


526 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-15 


CNG4_BOVIN 


240K PROTEIN OF 
ROD 

PHOTORECEPTOR 
CNG-CHANNEL 
CYCLIC- 
NUCLEOTIDE- 
GATED CATION 
CHANNEL 4 (CNG 
CHANNEL 4) 
MODULATORY 
SUBUNIT)) 


0.018 


527 


U95094 


Xenopus laevis 

A L> - 1 iN \^ o IN r 

(XL-INCENP) 
mRNA, complete 
cds 


2.00E-06 


HMZ1 DROME 


ZERKNUELLT 
PROIEIN 1 (ZEN-1) 


0.88 


528 


J03750 


Mouse single 
stranded DNA 
binding protein p9 
mRNA, complete 

cds. 


e-135 


P15JHUMAN 


ACTIVATED RNA 
POLYMERASE II 
TRANSCRIPTIONA 
L COACTIVATOR 
P15(PC4) (PI 4) 


3.00E-21 


529 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-12 


RS5_DROME 


40S RIBOSOMAL 
PROTEIN S5 


0.42 


530 


Z57610 


H.sapiens CpG 
DNA, clone 
187a 10, reverse 
read 

cpg!87al0.rtla . 


8.00E-61 


HN3BJVIOUSE 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


4.00E-15 


531 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 


3.00E-60 


<NONE> 


<NONE> 


<NONE> 


532 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA romnlpfp 
cds 


4.00E-11 


<NONE> 


<NONE> 


<NONE> 


533 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU INHUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.1 


534 


X92841 


H.sapiens MICA 
gene 


1 .00E-55 


LIN1HUMAN 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-09 
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Nearest Neighbor (BlastX vs. Non-Redundf 


int Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


535 


U60337 


Homo sapiens 
t>eta-mannosidase 
mRNA, complete 
cds 


0 


NODC_BRAEL 


N- 

ACETYLGLUCOSA 
MINYLTRANSFER 
ASE (EC 2.4.1.-) 


1.4 


536 


M21731 


Human lipocortin- 
V mRNA, 
complete cds. 


e-169 


ANX5_HUMAN 


ANNEXIN V 
(LIPOCORTIN V) 
(ENDONEXIN II) 
(CALPHOBINDIN 
I) (CBP-I) 
(PLACENTAL 
ANTICOAGULANT 
PROTEIN I) (PAP-l) 

•ALPHA) (VAC- 
ALPHA) 

(ANCHORIN CI I) 


1.00E-05 


537 


Y08013 


S.salar DNA 
segment 
containing GT 
repeat 


0.006 


<NONE> 


<NONE> 


<NONE> 


538 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


539 


M98502 


Mus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 


2.00E-17 


DYNACHICK 


DYNACTIN, 117 
KD ISOFORM 


7.4 


540 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


HXA3_HAEIN 


HEME:HEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.6 


541 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-13 


AMOJCLEAE 


AMINE OXIDASE 
PRECURSOR (EC 
1.4.3.6) 

(MONAMlNh 
OXIDASE) 
(TYRAMINE 
OXIDASE) 


1.5 


542 


AF083322 


Homo sapiens 
ceninoie 

associated protein 
CEPllOmRNA, 
complete cds 


e-133 


CA34J4UMAN 


PROCOLLAGEN 
ALPHA 3(IV) 
CHAIN 
PRECURSOR 


1.5 


543 


J03746 


Human 
glutathione S- 
transferase 
mRNA, complete 
cds. 


e-170 


GTMLHUMAN 


GLUTATHIONE S- 
TRANSFERASE, 
MICROSOMAL (EC 
2.5.1.18) 


5.00E-39 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


544 


U67522 


J vicil lal lULUtvUj 

jannaschii section 
64 of 150 of the 
complete genome 


U.-> / 


A IAA_nUMAIN 


A T OU A 1 a 

ALPHA- 1 A 
ADRFNFRHTP 

RECEPTOR 


4.3 


545 


U95102 


Xenopus laevis 
mitotic 

pnospnoproxein 
90 mRNA, 
complete cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


546 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


547 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


548 


D87001 


Human (lambda) 
DNA for 
immunoglobulin 
light chain 


0.35 


VAL3_TYLCU 


AL3 PROTEIN (C3 
PROTEIN) 


3.2 


549 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-08 


TEGU_HSV11 


LARGE 
TEGUMENT 
PROTEIN (VIRION 
PROTEIN UL36) 


0.004 


550 


D16991 


Human HepG2 
partial cDNA, 
clone 

hmd2d01m5 


8.00E-09 


PTM1_YEAST 


PROTEIN PTM1 
PRECURSOR 


0.033 


551 


M34025 


Human fetal Ig 
heavy chain 
variable region 


3.2 


<NONE> 


<NONE> 


<NONE> 


552 


M98502 


Mus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 


5.00E-14 


<NONE> 


<NONE> 


<NONE> 


553 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


554 


£-. / O / JU 


H. sapiens flow- 
sorted 

chromosome 6 
Hindi 1 1 fragment, 
SC6pA15C3 


3.UUh-2U 


AT TT1 I IT U < A XI 

ALU 1_HUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


5.00E-06 


555 


U74496 


Human 

chromosome 4q35 

subtelomeric 

sequence 


8.00E-08 


ICP4J/ZVD 

] 


TRANS-ACTING 
TRANSCRIPTIONA 
L PROTEIN ICP4 


0.39 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


556 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 


2.00E-56 


YHFK_ECOLI 


HYPOTHETICAL 
79.5 KD PROTEIN 
IN CRP-ARGD 
INTERGENIC 
REGION (0696) 


9.8 


557 


U65416 


Human MHC 
class I molecule 
(MICB) gene, 
complete cds 


0.12 


<NONE> 


<NONE> 


<NONE> 


558 


AG00O037 


Homo sapiens 
genomic DNA, 
21q region, clone: 
9H11A22 


5.00E-25 


<NONE> 


<NONE> 


<NONE> 


559 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-05 


<NONE> 


<NONE> 


<NONE> 


560 


AB007918 


Homo sapiens 
mRNA for 
KIAA0449 
protein, partial 
cds 


0.015 


VGLE_HSV11 


GLYCOPROTEIN E 
PRECURSOR 


2.2 


561 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin 


1.00E-73 


YCV2_YEAST 


HYPOTHETICAL 
13.8 KD PROTEIN 
IN PWP2-SUP61 
INTERGENIC 
REGION 


2.6 


562 


AB007878 


Homo sapiens 
KIAA0418 
mRNA, complete 
cds S 


e-110 


GLU2JV1AIZE 


GLUTELIN 2 
PRECURSOR 
(ZEIN-GAMMA) 
(27 KD ZEIN) 


0.72 


563 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


YJD6_YEAST 


HYPOTHETICAL 
49.0 KD PROTEIN 
IN NSP1-KAR2 
INTERGENIC 
REGION 


1.4 


564 


U27873 


Stealth virus 1 
clone 3B 11 T7 


0.002 


SYN INHUMAN 


SYNAPSINS IA 
AND IB (BRAIN 
PROTEIN 4.1) 


1.6 




L38951 


Homo sapiens 
importin beta 
subunit mRNA, 
complete cds 


2.00E-68 


VP2_BRD 


STRUCTURAL 
CORE PROTEIN 
VP2 


1.1 


566 


AF007155 


Homo sapiens 
clone 23763 
unknown mRNA, 
partial cds 


e-165 


YOHI_AZOVl 


HYPOTHETICAL 
33.2 KD PROTEIN 
IN IBPB 5'REGION 


7.5 
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Table 2 





; Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 

= 567 






r VALUc 


ACCESSION 


DESCRIPTION 


P VALUE 


Z56295 


H.sapiens CpG 
DNA, clone 10c2, 
forward read 
cpgl0c2.ftla . 


0.12 


Al ABCANFA 


ALPHA- IB 
ADRENERGIC 
RECEPTOR 
(FRAGMENT) 


0.85 


568 


Z83792 


G.gallus 
microsatellite 
DNA (LEI0222 


0.12 


<NONE> 


<NONE> 


<NONE> 


569 


U11820 


Feline 

immunodeflcienc 
y virus 

USIL2489__7B 
gag polyprotein 
(gag) gene, 
complete cds, 
polymerase 
polyprotein (pol) 
gene, partial cds, 
vif protein (vif), 
complete cds, and 
envelope 
glycoprotein 
(env), complete 
cds, complete g... 


1.1 


<NONE> 


<NONE> 


<NONE> 


570 


Ml 8065 


Mouse 1 8S and 
28S ribosomal 
DNA, 5' 
hypervariable 
(Vr) region, clone 
Ml. 


6.00E-04 


CC40_YEAST 


CELL DIVISION 
CONTROL 
PROTEIN 40 


3.7 


571 


AF053645 


Homo sapiens 
cellular apoptosis 
susceptibility 
protein (CSE1) 
gene, exons 3 
through 10 


2.00E-07 


YMQ4_CAEEL 


HYPOTHETICAL 
25.8 KD PROTEIN 
K02D10.4 IN 
CHROMOSOME III 


4.3 


572 


AUH JOO 


Human z.d kd 
mRNA for 
cytoskeietal 
tropomyosin 
i jviju^nm ) 


0 


<NONE> 


<NONE> 


<NONE> 


573 


AC001159 

] 


Homo sapiens 
[subclone 1 h9 
from PAC H92) 
DNA sequence 


5.00E-04 


XYNDCELFI j 
1 


ENDO- 1 ,4-BETA- 
K YLANASE D 
PRECURSOR (EC 
5.2.1.8) 


7.3 


574 


Z60625 1 
1 
i 
c 


^.sapiens CpG 
DNA, clone 2c 10, 
brward read 
;pg2cl0.ftlaa . 


4.00E-13 


<NONE> 


<NONE> 


<NONE> 


575 


AF070640 I 
c 


-lomo sapiens 
;lone 24781 


e-164 


<NONE> 


<NONE> 
. 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






mRNA sequence 










576 


Y11306 


Homo sapiens 
mRNA for hTCF- 
4 


2.00E-48 


TCF1 HUMAN 


T-CELL-SPECIFJC 
TRANSCRIPTION 
FACTOR 1 (TCF-1) 


2.00E-15 


577 


X65279 


pWE15 cosmid 
vector DNA 


7.00E-69 


OCLN_POTTR 


Q28793 potorous 
tridactylus (potoroo). 
occludin. 11/98 


0.71 


578 




Mouse DNA with 
homology to EBV 
IR3 repeat, 
segment 1, clone 

IviUZ. 


n An i 


LIVlol n I UAI 


CHAIN 

PRECURSOR 

(FRAGMENTS) 


1 0 


579 


X53744 


Canine mRNA for 
68kDA subunit of 
signal recognition 
particle (SRP68) 


e-162 


SR68_CANFA 


SIGNAL 
RECOGNITION 
PARTICLE 68 KD 
PROTEIN (SRP68) 


5.00E-16 


580 


AF086438 


Homo sapiens full 
length insert 
cDNA clone 
ZD80G11 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 


581 


U15140 


Mycobacterium 
bovis ribosomal 
proteins IF- 1 
complete cds, and 
S4 (rpsD) gene, 
partial cds 


1.3 


<NONE> 


<NONE> 


<NONE> 


582 


D 13292 


Human mRNA 
for ryudocan core 
protein 


e-166 


RSP4ARATH 


40S RIBOSOMAL 
PROTEIN SA (P40) 
(LAMININ 
RECEPTOR 
HOMOLOG) 


1.4 


583 


S71022 


neoplasm-related 
CI 40 product 
[human, thyroid 
carcinoma cells, 
mRNA, 670 nt] 


9.00E-30 


RL6HUMAN 


60S RIBOSOMAL 
PROTEIN L6 (TAX- 
RESPONSIVE 
ENHANCER 
ELEMENT 
BINDING PROTEIN 
107) (TAXREB107) 


5.6 


584 


L20934 


Anopheles 
gambiae complete 
mitochondrial 
genome 


0.014 


<NONE> 


<NONE> 


<NONE> 


585 


Z49269 


H. sapiens gene 
for chemokine 
HCC-1. 


1.1 


AMY1DICTH 


ALPHA-AMYLASE 
1 (EC 3.2.1. 1)(1,4- 
ALPHA-D- 
GLUCAN 

GLUCANOHYDRO 
LASE) 


2.5 
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Nearest N< 


2ighbor (BiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins)] 


SEQ 
ID 


ACCESSION 


I DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


586 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 


587 


AF029893 


Homo sapiens i- 

beta-l,3-N- 

acetylglucosamin 

yltransferase 

mRNA, complete 

cds 


0.13 


HEMO_PIG 


HEMOPEXIN 
PRECURSOR 
(HYALURONIDAS 
E) (EC 3.2.1.35) 


3.5 


588 


J05109 


T.thermophila 
calcium-binding 
25 kDa (TCBP 
25) protein gene, 
complete cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


589 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


<NONE> 


<NONE> 


<NONE> 


590 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 
106 (Zfpl06) 
mRNA, H3a-a 
allele, complete 
cds 


l.OOE-83 


SCRBPEDPE 


SUCROSE-6- 
PHOSPHATE 
HYDROLASE (EC 
3.2.1.26) 
(SUCRASE) 


10 


591 


Y11966 


B.aphidicola (host 
T.suberi) piasmid 
pBTsl genes 
leuA, hspA, 
repA2, repAl, 
leuB, leuC, leuD, 
leuA 


0.37 


<NONE> 


<NONE> 


<NONE> 


592 


U20428 


Human SNC19 
mRNA sequence 


1.00E-64 


YY22_MYCTU 


HYPOTHETICAL 
30.8 KD PROTEIN 
CY49.22 


0.29 


593 


AF043084 

i 
( 


Lycopersicon 
esculentum 
ethylene receptor 
lomoloe CETRl^ 
mRNA, complete 
:ds 


0.37 


KNIRDROME , 


ZYGOTIC GAP 
PROTEIN KNIRPS 


9.9 


594 


X65279 


5WE15 cosmid 
/ector DNA 


5.00E-66 


COA1 SV40 < 


"OAT PROTEIN 
VP\ 


0.001 


595 


U95098 : 
i 
f 

£ 


Kenopus laevis 
nitotic 

phosphoprotein 
14 mRNA, partial 


0.041 


UL88_HSV7J 1 


PROTEIN U59 


5.8 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 










596 


M91452 


Sus scrofa 
ryanodine 
receptor (RYR1) 
gene, complete 
cds. 


3.2 


<NONE> 


<NONE> 


<NONE> 


597 


U77327 


Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 


e-158 


GAT1_CHICK 


ERYTHROID 
TRANSCRIPTION 
FACTOR (GATA-1) 
(ERYF1) 


1.2 




U77327 


Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 


0 


RPB7_ARATH 


DNA-DIRECTED 
RNA 

POLYMERASE II 
19 KD 

POLYPEPTIDE (EC 
2.7.7.6) (RNA 
POLYMERASE II 
SUBUNIT 5) 


6.2 


599 


Y 16964 


Saccharomyces 
sp. mitochondrial 
DNA for OLI1 
gene, strain CID1 


0.37 


NMD5_YEAST 


NONSENSE- 
MEDIATED MRNA 
DECAY PROTEIN 5 


1.9 


600 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


<NONE> 


<NONE> 


<NONE> 


601 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


602 


AF091046 


Brugia pahangi 
nuclear hormone 
receptor (bhr-1) 
gene, partial cds 


1.1 


INVOJPONPY 


INVOLUCRIN 


0.23 


Am 

OKJJ 


M87339 


Human 

replication factor 
C, 37-kDa subunit 
mRNA, complete 
cds 


0 


AC12_HUMAN 


ACTIVATOR 1 37 
KD SUBUNIT 
(REPLICATION 
FACTOR C 37 KD 
SUBUNIT) (A 1 37 
KD SUBUNIT) (RF- 
C 37 KD SUBUNIT) 
(RFC37) 


l.OOE-38 


604 


D28116 


Human genes for 
collagen type IV 
alpha 5 and 6, 
exon 1 and exon 


0.39 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






1 » 
1 










605 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


606 


AE001149 


Borrelia 
burgdorferi 
(section 35 of 70) 
of the complete 
genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


607 


X14168 


Human pLC46 
with DNA 
replication origin 


6.00E-16 


Z136_HUMAN 


ZINC FINGER 
PROTEIN 136 


0.31 


608 


Z57610 


H.sapiens CpG 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla . 


7.00E-90 


HN3BRAT 


HEPATOCYTE 
NUCLEAR 
FACTOR 3 -BETA 
(HNF-3B) 


1.00E-19 


609 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.043 


PGCV_MOUSE 


VERSICAN CORE 

PROTEIN 

PRECURSOR 

(LARGE 

FIBROBLAST 

PROTEOGLYCAN) 

(CHONDROITIN 

SULFATE 

PROTEOGLYCAN 

CORE PROTEIN 2) 

(PG-M) 


3.5 


610 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


7.00E-07 


CA11_CHICK 


PROCOLLAGEN 
ALPHA 1(1) CHAIN 
PRECURSOR 


0.4 


611 


AB007956 


Homo sapiens 
mRNA, 
chromosome 1 
specific transcript 
KIAA0487 


e-3 06 


RRPBCVMA5 


RNA-DIRECTED 
RNA 

POLYMERASE (EC 
2.7.7.48) (ORF1B) 


9.7 


612 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redund, 


ant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


613 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


UL52_EBV 


HEL1CASE/PR1MA 
SE COMPLEX 
PROTEIN 
(PROBABLE DNA 
REPLICATION 
PROTEIN BSLF1) 


5.9 


614 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 


3.00E-71 


POLG_PVYHU 


GENOME 
POLY PROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


4.3 


615 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-09 


VP3_ROTPC 


INNER CORE 
PROTEIN VP3 


7.7 


616 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds 


e-143 


GLSLRAT 


GLUTAMINASL, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) (GLS) 


7.00E-67 


617 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, complete 
cds. 


0.37 


Y642_METJA 


HYPOTHETICAL 
PROTEIN MJ0642 


5.8 


618 


M21191 


Human aldolase 
pseudogene 
mRNA, complete 
cds. 


1.00E-32 


LINl_NYCCO 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-17 


619 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


NUCM_BOVIN 


NADH- 
UBIQUINONE 
OXIDOREDUCTAS 
E 49 KD SUB UNIT 
(EC 1.6.5.3) (EC 
1.6.99.3) 
(COMPLEX 1- 
49KD) (CI-49KD) 


0.044 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


620 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.005 


HEMZRHOCA 


FERROCHELATAS 
E(EC 4.99.1.1) 
(PROTOHEME 
FERRO-LYASE) 


4.4 


621 


AF041428 


Homo sapiens 
ribosomal protein 
s4 X isoform 
gene, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


622 


X07158 


Chironomus 
thummi DNA for 
Cla repetitive 
element 


0.13 


<NONE> 


<NONE> 


<NONE> 


623 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-04 


<NONE> 


<NONE> 


<NONE> 


624 


AF 100470 


Rattus norvegicus 
ri bosom e attached 
membrane protein 
4 (RAMP4) 
mRNA, complete 
cds 


1.00E-53 


<NONE> 


<NONE> 


<NONE> 


625 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 


2.00E-38 


<NONE> 


<NONE> 


<NONE> 


OZO 


M13452 


Human lamin A 
mRNA, 3'end. 


6.00E-16 


<NONE> 


<NONE> 


<NONE> 


627 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.014 


ACDVRAT 


ACYL-COA 
DEHYDROGENAS 
E, VERY-LONG- 
CHAIN SPECIFIC 
PRECURSOR (EC 
1.3.99.-) (VLCAD) 


4.00E-20 


628 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


629 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


630 


U95102 


Xenopus laevis 
mitotic 

phosphopiotein 
90 mRNA, 
complete cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


&EQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


631 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


632 


U95094 


Xenopus laevis 

AL-IINCllINr 

(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


YS83_CAEEL 


HYPOTHETICAL 

oO.y KLJ rKU 1 tlfN 

ZK945.3 IN 
CHROMOSOME II 


0.65 


633 


I U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


NRP_MOUSE 


NEUROPILS 
PROTEIN) 


2.7 


634 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-05 


Y4JN_RHISN 


HYPOTHETICAL 
16.3 KD PROTEIN 
Y4JN 


5.9 


635 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


636 


X64707 


H.sapiens BBC1 
mRNA 


e-179 


RL13_HUMAN 


60S RJBOSOMAL 
PRO I blN LI 5 
(BREAST BASIC 
CONSERVED 
PROTEIN I) 


5.00E-40 


637 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-08 


<NONE> 


<NONE> 


<NONE> 


638 


X14168 


Human pLC46 
with DNA 
replication origin 


5.00E-14 


SP3_HUMAN 


TRANSCRIPTION 
FACTOR SP3 (SPR- 
2) (FRAGMENT) 


0.19 


639 


X 90999 


H lanipns mRNA 

for Glyoxalase II 


9 00E-2O 


GI 02 HUMAN 


HYDROXYACYI G 

111 I— ■* Iv\ — 'A x riv/ l 1_/ V»J 

LUTATHIONE 
HYDROLASE (EC 
3.1.2.6) 


0.007 


640 


AF083322 


Homo sapiens 
centriole 

associated protein 
CEPllOmRNA, 
complete cds 


9.00E-51 


KIF4_MOUSE 


KINES1N-LIKE 
PROTEIN KIF4 


0.005 



265 



BNSDOCID: <WO_9933982A2_L> 




PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


641 


Z 12002 


M.musculus Pvt-1 
mRNA. 


0.36 


CP5F_CANTR 


CYTOCHROME 
P450 LIIA6 
(ALKANE- 
INDUCIBLE) (EC 
1.14.14.1) (P450- 
ALK3) 


5.6 


642 


Ml 0206 


R.sphaeroides 
reaction center L 
subunit (complete 
cds) and M 
suDunii ^3 enu ) 
genes. 


1.1 


YGR1_YEAST 


HYPOTHETICAL 
34.8 KD PROTEIN 
IN SUT1-RCK1 
1NTERGENIC 
REGION 


0.006 


643 


K02668 


E. coli ddl gene 
encoding D- 
alanine'D-alanine 
ligase and ftsQ 
and ftsA genes, 
complete cds, and 
ftsZ, gene, 5' end. 


3.3 


ANKB_HUMAN 


ANKYRIN, BRAIN 
VARIANT 1 
(ANKYRIN B) 
(ANKYRIN, 
NONERYTHROID) 


7.00E-07 


644 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


645 


X53616 


C.domesticus 
calnexin (pp90) 
mRNA 


1.1 


<NONE> 


<NONE> 


<NONE> 


646 


Aj/UIU i 


Unman PHI OA 1 

Human k^kjl>z/\ i 
gene for collagen 
II alpha 1 chain, 
exons E2-E15 




PRIO PICi 
i I vi ^ — / r ivj 


MAJOR PRION 
PROTEIN 

PRECURSOR (PRP) 


1.9 


647 




Aenopus ia.evis 
mitotic 

phosphoprotein 
43 mRNA, partial 
cds 


i 1 
i.i 


1 tt 07 HSV2H 


PROTEIN UL7 


7.3 


648 


X52956 


Human CAMII- 

n^i^ PAltn/vliil in 

retropseudogene 


0.37 


PRTP_EBV 


PROBABLE 
PROCESSING AND 
TRANSPORT 
PROTEIN 


7.5 


649 


M93425 


Human protein 
tyrosine 

nhn<inhat?Lse 
(PTP-PEST) 
mRNA, complete 
cds. 


0 


PTNCJiUMAN 


PROTEIN- 
TYROSINE 
PHOSPHATASE Gl 
(EC 3.1.3.48) 
(PTPG1) 


e-107 


650 


L47615 


Mus musculus 
DNA-binding 
protein (Fli-1) 
gene, 5' end of 
cds. 


0.13 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03 IN 
CHROMOSOME I 


2.00E-07 


651 


U60337 


Homo sapiens 
beta-m annos i dase 
mRNA, complete 


0 


GIL1_ENTHI 


GALACTOSE- 
INHIBITABLE 
LECTIN 170 KD 


0.22 
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Nearest Neighbor (BlastX vs. Non-Redund* 


int Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 






SUBUNIT 




652 


U08813 


Oryctolagus 

cuniculus 

Na-H/glucose 

cotransporter- 

related protein 

mRNA, complete 

cds. 


1.00E-22 


NAG1_HUMAN 


SODIUM/GLUCOS 
E 

COTRANSPORTER 
1 (NA(+)/GLUCOSE 
COTRANSPORTER 
1)(HIGH 
AFFINITY 
SODIUM- 
GLUCOSE 
COTRANSPORTER 

) 


0.1 


653 


Y00282 


Human mRNA 
for ribophorin II 


2.00E-78 


RIB2_HUMAN 


DOLICHYL- 
DIPHOSPHOOLIG 
OSACCHA RIDE- 
PROTEIN 

GLYCOSYLTRANS 
FERASE 63 KD 
SUBUNIT 
PRECURSOR (EC 
2.4.1.119) 
(RIBOPHORIN II) 


5.00E-19 


654 


D10051 


Human gene for 
92-kDa type IV 
collagenase, 5- 
flanking region 


0.014 


TAGBDICDI 


PRESTALK- 
SPECIFIC 
PROTEIN TAGB 
PRECURSOR (EC 
3.4.2L-) 


7.6 


655 


M29930 


Human insulin 
receptor (allele 2) 
gene, exons 14, 
15, 16 and 17. 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


656 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds 


0 


YG2SYEAST 


HYPOTHETICAL 
69.9 KD PROTEIN 
IN MIC1-SRB5 
INTERGENIC 
REGION 


0.002 


657 


X68792 


S.coelicolor 
A3(2) promoter 
sequence pth270 




I Dou I O/A J 1 


HYPOTHETICAL 
27.0 KD PROTEIN 
IN VAL1-HSP26 
INTERGENIC 
REGION 


0.073 


658 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU INHUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.2 ! 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


659 


U 15522 


Sus scrofa clone 
pvgla Ig heavy 
chain variable 
VDJ region 
mRNA, partial 
cds. 


3.2 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


3.2 


660 


M20918 


C.thummi piger 
haemoglobin (Hb) 
gene DNA, 
complete cds. 


0.12 


YT25CAEEL 


HYPOTHETICAL 
59.9 KD PROTEIN 
B0304.5 IN 
CHROMOSOME II 


0.033 


661 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


662 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.001 


ENVMLVFP 


ENV 

POLYPROTEIN 
PRECURSOR 
(CONTAINS: 
KNOB PROTEIN 
GP70; SPIKE 
PROTEIN PI 5E;R 
PROTEIN) 


3.3 


663 


M97287 


Human 

MAR/SAR DNA 
binding protein 
(SATB1) mRNA, 
complete cds. > :: 
gb|I58691|I58691 
Sequence 1 from 
patent US 
5652340 


0 


SAT1_HUMAN 


DNA-BINDING 
PROTEIN SATB1 
(SPECIAL AT-RICH 
SEQUENCE 
BINDING PROTEIN 
1) 


2.00E-20 


664 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds 


e-168 


K2C4_BOVIN 


KERATIN, TYPE II 
CYTOSKELETAL 
59 KD, 

COMPONENT IV 


4.00E-10 


665 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


e-152 


PLAPJvlOUSE 


PHOSPHOLIPASE 
A-2-ACTIVATING 
PROTEIN (PLAP) 


4.00E-13 


666 


M73047 


Homo sapiens 
tripeptidyl 
peptidase II 
mRNA, complete 
cds. 


0 


MERT_STRLI 


MERCURIC 

TRANSPORT 

PROTEIN 

(MERCURY ION 

TRANSPORT 

PROTEIN) 


4.4 
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SEQ 
ID 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


667 


U09954 


Human ribosomal 
protein L9 gene, 
5' region and 


0 


RL9HUMAN 


60S RIBOSOMAL 
PROTEIN L9 


2.00E-11 


OOO 


X98330 


H.sapiens mRNA 
for ryanodine 
receptor 2 


1.1 


HS74JV10USE 


HEAT SHOCK 70 
KD PROTEIN AGP- 
2 


0.034 


669 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.002 


RPC2JDROME 


DNA-DIRECTED 
RNA 

POLYMERASE III 
128 KD 

POLYPEPTIDE 


1.1 


670 


AF069250 


Homo sapiens 

okadaic acid- 

inducible 

phosphoprotein 

(OA48-18) 

mRNA, complete 

cds 


7.00E-80 


LEGBPEA 


LEGUMIN B 
(FRAGMENT) 


0.011 


671 


Z71419 


S.cerevisiae 
chromosome XIV 
reading frame 
ORF YNL143c 


1.1 


FOCD_ECOLI 


OUTER 
MEMBRANE 
USHER PROTEIN 
FOCD 

PRECURSOR 


9.7 


672 


AF044965 


Homo sapiens 
polio virus related 
protein 2 gene, 
alpha isoform, 
exon 6 and partial 
cds 


e-167 


PVR_MOUSE 


POLIOVIRUS 
RECEPTOR 
HOMOLOG 
PRECURSOR 


1.00E-12 


673 


X65319 


Cloning vector 
pCAT-Enhancer 


2.00E-80 


S106JHUMAN 


CALCYCLIN 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 
PROTEIN) 
CALCIUM- 
BINDING PROTEIN 
A6) 


3 OOE-15 


674 


D29655 


Pig mRNA for 
UMP-CMP 
kinase, complete 
cds 


e-103 


V319_ASFB7 


J3 19 PROTEIN 


4.3 


675 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


VEGRJIAT 


VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR 
RECEPTOR 1 
PRECURSOR 
RECEPTOR FLT) 
(FLT-1) 


3.3 i 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


676 


D90217 


S. cerevisiae gene 

forYmL33, 

mitochondrial 

ribosomal 

proteins of large 

subunit 


2.00E-07 


MALYECOLI 


MALY PROTEIN 
(EC 2.6.1.-) 


5.6 


677 


AF038952 


Homo sapiens 
cofactor A protein 
mRNA, complete 
cds 


e-160 


TICAJVIOUSE 


TCP1- 

CHAPERONIN 
COFACTOR A 


4.00E-19 


678 


Z96950 


Gorilla gorilla 
DNA sequence 
orthologous to the 
human Xp:Yp 
telomere-junction 
region 


5.00E-14 


YHBZ_ECOLI 


HYPOTHETICAL 
43.3 KD GTP- 
BINDING PROTEIN 
IN DACB-RPMA 
INTERGENIC 
REGION (F390) 


3.3 


679 


D50418 


Mouse mRNA for 
AREC3, partial 
cds 


2.00E-79 


CYGXRAT 


OLFACTORY 
GUANYLYL 
CYCLASE GC-D 
PRECURSOR (EC 

4.6.1.2) 


1.1 


680 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-08 


P2C2_SCHPO 


PROTEIN 

PHOSPHATASE 2C 
HOMOLOG 2 (EC 
3.1.3.16) 


1.00E-04 


681 


AL010280 


Plasmodium 

falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-106, complete 
sequence 


0.12 


<NONE> 


<NONE> 


<NONE> 


682 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


5.00E-04 


VSM2TRYBB 


VARIANT 
SURFACE 
GLYCOPROTEIN 
MITAT 1.2 
PRECURSOR (VSG 
221) 


4.3 


683 


U00238 


Homo sapiens 

aliitaminp PI? PP 

amidotransferase 
(GPAT) mRNA, 
complete cds 


0 


<NONE> 


<NONE> 


<NONE> 


684 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


PRPRSALTY 


PROPIONATE 

CATABOLISM 

OPERON 

REGULATORY 

PROTEIN 


1.5 
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SEQ 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


685 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-07 


YAND_SCHPO 


HYPOTHETICAL 
30.4 KD PROTEIN 
C3H1.13IN 
CHROMOSOME I 


0.38 


686 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


687 


U95102 


Xenopus laevis 
mitotic 

Dhosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


Al AA_RAT 


ALPHA- 1 A 
ADRENERGIC 
RECEPTOR (RA42) 


4.4 


688 


L26956 


Mesocricetus 
auratus stearyl- 
CoA desaturase 
sequence 
including male 
hormone 
dependent gene 
derived from 
hamster 
frankorgan 


4.00E-33 


<NONE> 


<NONE> 


<NONE> 


689 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


690 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


Y093_CAEEL 


HYPOTHETICAL 
58.5 KD PROTEIN 
T20B12.3 IN 
CHROMOSOME III 


2.00E-08 


691 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


8.00E-09 


<NONE> 


<NONE> 


<NONE> 


692 


ABO 17026 


Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYBRABIT 


OX Y STEROL- 

£31 IN Ul JN Kj rt\\J I CllN 


1.00E-34 


693 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


UF02_MAIZE 


FLAVONOL 3-0- 
GLUCOSYLTRANS 
FERASE (EC 
2.4.1.91) 


3.1 
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SEQ 
ID 


ACCESSION 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


694 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-04 


<NONE> 


<NONE> 


<NONE> 


695 


U34954 


Caenorhabditis 
elegans 
cyclophilin 
isoform 10 


5.00E-24 


CYPACAEEL 


PEPTIDYL- 
PROLYL CIS- 
TRANS 

ISOMERASE 10 
(EC 5.2.1.8) 


2.00E-29 


696 


AB011167 


Homo sapiens 
mRNA for 
KIAA0595 
protein, partial 
cds 


0 


RFX5HUMAN 


BINDING 

REGULATORY 

FACTOR 


2.1 


697 


U03886 


Human GS2 
mRNA, complete 
cds. 


2.00E-28 


SKDIJvlOUSE 


SKD1 PROTEIN 


4.00E-17 


698 


AF086275 


Homo sapiens full 
length insert 
cDNA clone 
ZD45C02 


3.00E-41 


SPT7_YEAST 


TRANSCRIPTIONA 
L ACTIVATOR 
SPT7 


0.82 


699 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


CA1EHUMAN 


COLLAGEN 
ALPHA 1(XV) 
CHAIN 
PRECURSOR 


1.1 


700 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-11 


E434_ADECC 


Q65962 canine 
adenovirus type 1 
(strain ell), early e4 
31 kd protein. 11/98 


4.4 


701 


LI 7340 


Drosophila 
melanogaster 
germline 
transcription 
factor gene, 
complete cds. 


3.3 


CISY_JTETTH 


CITRATE 
SYNTHASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
4.1.3.7) (14 NM 
FILAMENT- 
FORMING 
PROTEIN) 


9.7 


702 


X58170 


M.musculus 
mrviN/\ ior l- 
Complex Tcp-lOa 
gene 


2.00E-45 


PME2JLYCES 


PECTINESTERASE 
I rK±.(_.UK.MJR(EC 
3.1.1.1 1) (PECTIN 
METHYLESTERAS 
E) (PE 2) 


7.4 


703 


Z96207 


H.sapiens 
telomeric DNA 
sequence, clone 
12PTEL049,read 
12PTELOO049.se 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 
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ID 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






q 










704 


X58430 


Human Hoxl.8 
gene 


e-146 | 


HXAA_HUMAN 


HOMEOBOX 
PROTEIN HOX-A10 
(HOX-lH)(HOX- 
1.8) (PL) 


4.00E-05 


705 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-06 


YN39_SYNP7 


HYPOTHETICAL 
9.2 KD PROTEIN 
IN CYST-CYSR 
INTERGENIC 
REGION (ORF81) 


0.89 


706 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 [ 


MYSH_BOVIN 


MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


0.001 


707 


M19961 


Human 
cytochrome c 
oxidase subunit 
Vb (coxVb) 
mRNA, complete 
cds. 


e-123 


OTHU5B 


<NONE> 


3.00E-30 


708 


X68380 


M.musculus gene 
for cathepsin D, 
exon 3 


5.00E-04 


42_MOUSE 


ERYTHROCYTE 
MEMBRANE 
PROTEIN BAND 
4.2 (P4.2) 
(PALLIDIN) 


9.9 


709 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


1.00E-11 


TCPADROME 


T-COMPLEX 
PROTEIN 1 , 
ALPHA SUBUNIT 
(TCP- 1 -ALPHA) 


4.3 


710 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


711 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-12 


<NONE> 


<NONE> 


<NONE> 


712 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


713 


ABO 18323 


Homo sapiens 
mRNA for 
KIAA0780 
protein, partial 
cds 


3.00E-41 


LBRCHICK 


LAM1N B 
RECEPTOR 


3.4 


714 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


YM8LYEAST 


HYPOTHETICAL 
71.1 KD PROTEIN 
IN DSK2-CAT8 
INTERGENIC 
REGION 


3.00E-08 


715 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-13 


PSC_DROME 


POSTERIOR SEX 
COMBS PROTEIN 


0.6 


716 


L28101 


Homo sapiens 
kallistatin (PI4) 
gene, exons 1 -4, 
complete cds 


7.00E-07 


IRKX_RAT 


INWARD 
RECTIFIER 
POTASSIUM 
CHANNEL BIR9 
(KIR5.1) 


5.4 


717 


AC001038 


Homo sapiens 
(subclone 2 h2 
from PI H49) 
DNA sequence 


8.00E-09 


MGMT_YEAST 


METHYLATED- 
DNA— PROTEIN- 
CYSTEINE 
METHYLTRANSFE 
RASE 


0.48 


718 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


YWDE_BACSU 


HYPOTHETICAL 
19.9 KD PROTEIN 
IN SACA-UNG 
INTERGENIC 
REGION 
PRECURSOR 


1.8 


719 


U01139 


Mus musculus 
B6D2F1 clone 
2C11B mRNA. 


e-110 


GSCDROME 


HOMEOBOX 

PROTEIN 

GOOSECOID 


7.2 


720 


AB017430 


Homo sapiens 
mRNA for 
dnesin-like DNA 
binding protein, 
complete cds 


0 


YBAV_ECOLI 


HYPOTHETICAL 
12.7 KD PROTEIN 
IN HUPB-COF 
INTERGENIC 
REGION 


0.17 


721 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.001 


CPCF_SYNP2 


PHYCOCYANOBIL 
IN LYASE BETA 
SUBUNIT(EC 4.-.-.- 


2.4 


722 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


<NONE> 


<NONE> 


<NONE> 
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mt Proteins) 


SEQ 
ID 


ACCESSION 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


723 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.04 


YKK7CAEEL 


HYPOTHETICAL 
54.9 KD rKO 1 hJIN 
C02F5.7 IN 
CHROMOSOME III 


0.057 


724 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


H5_CAIMO 


HISTONE H5 


0.39 


725 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-09 


DED1YEAST | 


PUTATIVE ATP- 
DEPENDENT RNA 
HELICASE DED1 


0.5 


726 


J04617 


Human elongation 
factor EF- 1 -alpha 
gene, complete 
cds. > :: 

dbj|E02629|E0262 
9 DNA of human 
polypeptide chain 
elongation factor- 
1 alpha 


5.00E-36 


ALU7HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 
!!!! 


0.84 


727 


X54859 


Porcine TNF- 
alpha and TNF- 
beta genes for 
tumour necrosis 
factors alpha and 
beta, respectively. 


3.3 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


5.6 


728 


D4991 1 


Thermus 
thermophilus 
UvrA gene, 
complete cds 


0.014 


CC48CAPAN 


CELL DIVISION 
CYCLE PROTEIN 
48 HOMOLOG 


9.9 


72Q 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-06 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 
PRECURSOR 


0.011 


730 


D 15057 


Human mRNA 
for DAD- 1 , 
compieie cus 


0 


DAD 1 _HUM AN 


DEFENDER 
AGAINST CELL 
DEATH 1 <DAD-n 


8.00E-16 


731 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-06 


ANFD_RHOCA 


NITROGENASE 
IRON-IRON 
PROTEIN ALPHA 
CHAIN (EC 
1.18.6.1) 

(NITROGENASE 
COMPONENT I) 
(DINITROGENASE 


9.6 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












) 




732 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 

cds 


7.00E-07 


EFTU_CHLVI 


ELONGATION 
FACTOR TU (EF- 
TU) 


2.5 | 


733 


AB018335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


0 


TRYMRAT 


MAST CELL 
TRYPTASE 
PRECURSOR (EC 
3.4.21.59) 


5.6 


734 


X98743 


H.sapiens mRNA 
for RNA helicase 
(Myc-regulated 
dead box protein) 


0.04 


<NONE> 


<NONE> 


<NONE> 


735 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


736 


Z49314 


S.cerevisiae 
chromosome X 
reading frame 
ORF YJL039c 


3.2 


<NONE> 


<NONE> 


<NONE> 


737 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


2.00E-76 


T> O 

738 


J04038 


Human 

glyceraIdehyde-3- 

phosphate 

dehydrogenase 


2.00E-47 


SDC INHUMAN 


SYNDECAN-1 

PRECURSOR 

(SYND1)(CD138) 


3.5 


739 


AFO 10238 


Homo sapiens 
von Hippel- 
Lindau tumor 
suppressor 


1 .00E-09 


LIN1HUMAN 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


0.001 


740 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


YQJX_BACSU 


HYPOTHETICAL 
13.2 KD PROTEIN 
IN GLNQ-ANSR 
INTERGENIC 
REGION 


9.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundi 


int Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


741 


L21186 


Human lysyl 
oxidase-like 
protein mRNA, 
complete cds. 


e-145 


OXRTL 


<NONE> 


1 .00E-34 


742 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-05 


CC48_SOYBN 


CELL DIVISION 
CYCLE PROTEIN 
48 HOMOLOG 
(VALOSIN 
CONTAINING 
PROTEIN 

HOMOLOG) (VCP) 


7.6 


743 


AF009203 


Homo sapiens 
YAC clone 
377A1 unknown 
mRNA, 
3'untranslated 
region 


3.3 


<NONE> 


<NONE> 


<NONE> 


744 


Z74894 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL152w 


0.12 


CD14_RABIT 


Q28680 oryctolagus 
cuniculus (rabbit), 
monocyte 
differentiation 
antigen cd 1 4 
precursor. 1 1/98 


1.9 


745 


1 U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


9.00E-10 


KIN3_YEAST 


SERINE/THREONI 
NE-PROTEIN 
KINASE KIN3 (EC 
2.7.1.-) 


2.5 


746 


! U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-05 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A1 1.03 IN 
CHROMOSOME I 


7.00E-17 


747 


S61044 


ALDH3=aldehyd 
e dehydrogenase 
isozyme 3 
[human, stomach, 
mRNA Partial, 
1362 nt] 


0 


DH A P_HUM AN 


ALDEHYDE 
DEHYDROGENAS 
E, DIMERIC NADP- 
PREFERRING (EC 
1.2.1.5) (CLASS 3) 


2.00E-71 


748 


U95094 


Xenopus laevis 

(XL-INCENP) 
mRNA, complete 
cds 


2.00E-08 


CA1E_CHICK 


COLLAGEN 
ALPHA KXIV^ 
CHAIN 
PRECURSOR 
(UNDULIN) 


0.36 


749 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


/\ Lt o o o 1 VJlN 




n \/Al T ITT 

r VALUt 


A PPCCCIAXl 

ACCESSION 


DESCRIPTION 


P VALUE 


750 


L14815 


Entamoeba 
histolytica HM- 
1:IMSS galactose- 
specific adhesin 
170kD subunit 
(hgl3) gene, 
complete cds. 


0.12 


<NONE> 


<NONE> 


<NONE> 


751 


X63785 


T.thermophila 
gene for snRNA 
U2-2 


1.1 


<NONE> 


<NONE> 


<NONE> 


752 


M83756 


Mytilus eduiis 
mitochondrial 
NADH 

dehydrogenase 
subunit 5 (ND5) 
gene, 3' end; 
NADH 

dehydrogenase 
subunit 6 (ND6) 
gene, complete 
cds; and 

cytochrome b (cyt 
b), 5' end. 


0.042 


DSC1_HUMAN 


DESMOCOLLIN 
1A/1B 

PRECURSOR 
(DESMOSOMAL 
GLYCOPROTEIN 
2/3)(DG2/DG3) 


2.6 


753 


AB001066 


Brown trout 
microsatellite 
DNA sequence 


0.38 


IMB3_HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


1.2 


754 


AF064787 


Lotus japonicus 
rac GTPase 
activating protein 
1 mRNA, 
complete cds 


0.51 1 


<NONE> 


<NONE> 


<NONE> 


755 


U20608 


Dictyostelium 
discoideum 
unknown spore 
germination- 
specific protein- 
like protein, orfl, 
orf2 and orf3 
genes, complete 
cds 


0.043 


<NONE> 


<NONE> 


<NONE> 


756 


M778I2 


Rabbit myosin 
heavy chain 
mRNA, complete 
cds. 


1.2 


RBL1_HUMAN 


RETINOBLASTOM 
A-LIKE PROTEIN 1 
(107 KD 

RETINOBLASTOM 
A-ASSOCIATED 
PROTEIN) (PRB1) 
(PI 07) 


4.9 
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Nearest Neighbor (BlastX vs. Non-Redund* 


int Proteins) 


SEQ 
ID 

= wr 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


X63789 


T.thermophila 
genes for snRNA 
U5-l,snRNA U5- 
2 


0.058 


<NONE> 


<NONE> 


<NONE> 


758 


D50646 


Mouse mRNA for 
SDF2, complete 
cds 


2.00E-27 


PMT3_YEAST 


DOL1CHYL- 
PHOSPHATE- 
MANNOSE-- 
PROTEIN 

MANNOSYLTRAN 
SFERASE 3 (EC 
2.4.1.109) 


0.002 


759 


L81583 


Homo sapiens 
(subclone 3_g2 
from PI Hll) 
DNA sequence 


3.00E-19 


ALU5_HUMAN 


!!!! ALU 

SUBFAMILY SC 
WARNING ENTRY 
!!!! 


0.86 


760 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


SYFA_YEAST 


PHENYLALANYL- 
TRNA 

SYNTHETASE 
ALPHA CHAIN 
CYTOPLASMIC 


5.7 


761 


AF000370 


Homo sapiens 
polymorphic CA 
dinucleotide 
repeat flanking 
region 


6.00E-89 


APPl_MOUSE 


AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(APLP) 


5.7 


762 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


763 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


PSF_HUMAN 


PTB-ASSOCIATED 
SPLICING FACTOR 
(PSF) 


0.72 


764 


AB018288 


Homo sapiens 
mRNA for 
KIAA0745 
protein, partial 
cds 


0 


TC2ACAEBR 


TRANSPOSABLE 
ELEMENT TCB2 
TRANSPOSASE 


1.5 


765 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds 


0.38 


PMT2YEAST 


DOL1CHYL- 
PHOSPHATE- 
MANNOSE- 
PROTEIN 

MANNOSYLTRAN 
SFERASE 2 (EC 
2.4.1.109) 


0.18 
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Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


766 


a r?A 1 HI C7 


Oryza sativa low 
molecular early 
light-inducible 
protein mRNA, 
complete cds 


0.38 


RGS3HUMAN 


REGULATOR OF 
G-PROTEIN 
SIGNALLING 3 
(RGS3)(RGP3) 


0.23 


767 


U67599 


Methanococcus 
jannaschii section 
141 of 150 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


768 


X74178 


B.taurus 
microsatellite 
DNA INRA153 


0.13 


FAG1_SYNY3 


P73574 

synechocystis sp. 
(strain pec 6803). 3- 
oxoacyl-[acyl-carrier 
protein] reductase 1 
(ec 1.1.1.100) (3- 
ketoacyl- acyl carrier 
protein reductase 1). 
11/98 


5.00E-16 


769 


AF041858 


Mus musculus 
synaptojanin 2 
isoform delta 
mRNA, partial 
cds 


0.043 


CA44_HUMAN 


COLLAGEN 
ALPHA 4(IV) 
CHAIN 
PRECURSOR 


0.24 


770 


JO 1 404 


Drosophila 
melanogaster 
mitochondrial 
cytochrome c 
oxidase subunits, 
ATPase6, 7 
tRNAs (Trp, Cys, 
Tyr, Leu(UUR), 
Lys, Asp, Gly) 
genes, and 
unidentified 
reading frames 
A61, 2 and 3. 


0.021 


NU1M_CITLA 


NADH- 
UBIQUINONE 
OXIDOREDUCTAS 
E CHAIN 1 (EC 
1.6.5.3) 


7.2 


771 


AL022317 


Human DNA 
sequence from 
clone 140L1 on 
chromosome 
22ql3.1-13.31, 
complete 
sequence [Homo 
sapiens] 


3.00E-41 


ALU7_HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 

MM 


4.00E-08 


772 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-09 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


773 


AF095927 


Rattus norvegicus 
protein 

phosphatase 2C 
mRNA, complete 
cds 


0 


P2C_PARTE 


PROTEIN 

PHOSPHATASE 2C 
(EC 3. 1.3. 16) (PP2C) 


1.00E-16 


774 


X87212 


H. sapiens mRNA 
for cathepsin C 


0 


CATC_HUMAN 


DIPEPTIDYL- 
PEPTIDASE I 
PRECURSOR (EC 
3.4.14.1) 


2.00E-46 


775 


X05283 


Drosophila 
melanogaster 
PKCG7 gene 
exons 7-14 for 
protein kinase C 


4.5 


<NONE> 


<NONE> 


<NONE> 


776 


X03558 


Human mRNA 
for elongation 
factor 1 alpha 
subunit 


0 


EF1 1 HUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-1 -ALPHA- 1) 


1.00E-83 


777 


X06960 


Aspergillus 

nidulans 

mitochondrial 

DNA for 

cytochrome 

oxidase subunit 3, 

tRNA-Tyr 


0.23 


<NONE> 


<NONE> 


<NONE> 


778 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


YMT8YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP1 16-FAR3 
INTERGENIC 
REGION 


5.00E-07 


779 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


NAT1_YEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


5.00E-23 


/oU 


U59706 


Gallus gallus 
alternatively 
spliced AMPA 
glutarnate 
receptor, isoform 
OJuKz nop, 
(GluR2) mRNA, 
partial cds. 


0.014 


PPOL_SARPE 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


0.021 


781 


U57391 


Rattus norvegicus 
FceRI gamma- 
chain interacting 
protein SH2-B 
(SH2-B) mRNA, 
complete cds 


1.00E-84 


<NONE> 


<NONE> 


<NONE> 



281 




PCT/US98/27610 



BNSDOCID: <WO i 



_9933982A2J_> 



WO 99/33982 PCT/US98/27610 

Table 2 





Nearest Neighbor (BlastN vs. Genbank) 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


782 


AB014591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


7.00E-57 


SSGP_VOLCA 


SULFATED 
SURFACE 
GLYCOPROTEIN 
185 (SSG 185) 


5.3 


783 


AJ008065 


Chrysolina bankii 
1 6S rRN A gene, 
mitotype B2 


0.043 


<NONE> 


<NONE> 


<NONE> 


784 


AF067212 


Caenorhabditis 
elegans cosmid 
F37F2 


0.005 


MEK1 JRAT 


MAPK/ERK 
KINASE KINASE 1 
(EC 2.7.1.-) (MEK 
KINASE 1) 


4.5 


785 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.042 


<NONE> 


<NONE> 


<NONE> 


786 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-09 


<NONE> 


<NONE> 


<NONE> 


787 


Y13401 


Homo sapiens 
CD3 delta gene, 
enhancer 
sequence 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


788 


AE001038 


Archaeoglobus 
fulgidus section 
69 of 172 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


789 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


790 


AF041463 


Manihot esculenta 
elongation factor 
1 -alpha 


1.4 


<NONE> ; 


<NONE> 


<NONE> 


791 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


HXA3FLAEIN 


HEME.HEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.7 


792 


Z12112 


pWE15A cosmid 
vector DNA 


3.00E-29 


PKWA_THECU 


PUTATIVE 
SERINE/THREONI 
NE-PROTEIN 
KINASE PKWA 
(EC 2.7.1.-) 


2.00E-04 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


793 


U85193 


Human nuclear 
factor I-R2 
(NFIB2) mRNA, 
complete cds 


4.00E-44 


<NONE> 


<NONE> 


<NONE> 


794 


U89331 


Human 

pseudoautosomal 
hnmpndomain- 

I lUlllwvUWlllUllJ 

containing protein 
(PHOG) mRNA, 
complete cds 


7.00E-06 


NRL_HUMAN 


NEURAL RETINA- 
SPECIFIC 
LEUCINE ZIPPER 
PROTEIN (NRL) 


6.3 


795 


AF055666 


Mus musculus 
kinesin light chain 
2 (Klc2) mRNA, 
complete cds 


0.52 


PSPD_BOVIN 


PULMONARY 
SURFACTANT- 
ASSOCIATED 
PROTEIN D 
PRECURSOR 


0.33 


796 


L13321 


Homo sapiens 
iduronate-2- 
sulfatase (IDS) 
gene, exon 1 , 
incomplete 5' end. 


0.14 


YRP2YEAST 


HYPOTHETICAL 
84.4 KD PROTEIN 
IN RPC2/RET 1 
3'REGION 


0.27 


797 


ALO 10270 


Plasmodium 

falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-96, complete 
sequence 


0.37 


YTH3CAEEL 


HYPOTHETICAL 
75.5 KD PROTEIN 
C14A4.3 IN 
CHROMOSOME II 


2 


798 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.015 


IMB3HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


0.063 


799 


U70139 


Mus musculus 
putative CCR4 
protein mRNA, 
partial cds 


0 


CCR4_YEAST 


GLUCOSE- 

REPRESSIBLE 

ALCOHOL 

DEHYDROGENAS 

E 

TRANSCRIPTION A 
L EFFECTOR 
(CARBON 
CATABOLITE 
REPRESSOR 
PROTEIN 4) 


5.00E-11 


800 


L26507 


Mouse myocyte 
nuclear factor 
(MNF) mRNA, 
complete cds. 


3.00E-41 


MNF_MOUSE 


MYOCYTE 
NUCLEAR 
FACTOR (MNF) 


4.00E-18 
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SEQ 
ID 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


801 


U20527 


Mus musculus 
chemokine KC 
gene, 5* region. 


0 


GRO_MOUSE 


GROWTH REGULATED 
PROTEIN PRECURSOR 
(PLATELET-DERIVED 
GROWTH FACTOR- 
INDUCIBLE PROTEIN 
KC) (SECRETORY 
PROTEIN N51) 


1.00E-28 


802 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


MYSA_DROME 


MYOSIN HEAVY 
CHAIN, MUSCLE 


0.089 


803 


U05823 


Mus musculus 
pericentrin 
mRNA, 
complete cds. 


1.00E-94 


M84D_DROME 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD 


0.099 


804 


U67468 


Methanococcus 
jannaschii 
section 10 of 150 
of the complete 
genome 


0.4 


<NONE> 


<NONE> 


<NONE> 


805 


U14178 


Human type 11 
IL-1 receptor 
gene, exon 1 B 


1.00E-19 


A M PHH U MAN 


AMPHIPHYSIN 


2.9 


806 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


4.00E-86 


807 


D17218 


Human HepG2 
3' region Mbol 
cDNA, clone 
hmd3g02m3 


e-136 


CA1A_HUMAN 


COLLAGEN ALPHA 
1(X) CHAIN 
PRECURSOR 


3.00E-04 


808 


Z57610 


H. sapiens CpG 
DNA, clone 
187a 10, reverse 
read 

cpgl87al0.rtla . 


e-102 


HN3BJVIOUSE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-24 


809 


D 14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds 


0 


NCD_DROME 


CLARET 

SEGREGATIONAL 
PROTEIN 


1.00E-70 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundan 


t Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


810 


X56317 


Xiphophorus 
maculatus 
Xmrk(proto- 
oncogene) gene 
for receptor 
tyrosine kinase. 


0.49 


WNlB_MOUSE 


WNT-10B PROTEIN 
PRECURSOR (WNT-12) 


7.2 


811 


M36200 


Human 

synaptobrevin 1 
(SYBl)gene, 
exon 5. 


0.2 


VE2_HPV14 


REGULATORY 
PROTEIN E2 


3.1 


812 


M18157 


Human glandular 
callikrein gene, 
complete cds. 


1.5 


EKLFMOUSE 


ERYTHROID 
KRUEPPEL-LIKE 
TRANSCRIPTION 
FACTOR (EKLF) 


1.1 


813 


D25215 


Human mRNA 
for KIAA0032 
gene, complete 
cds 


1.9 


YXIS_SACER 


HYPOTHETICAL 28.9 
KD PROTEIN IN XIS 
5'REGION (ORF1) 


l .J 


814 


M96628 


Human gene 
sequence, 5' end. 


2.00E-06 


AGRI_DISOM 


AGRIN (FRAGMENT) 


9.5 


815 


Z57610 


H.sapiens CpG 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla . 


e-102 


HN3BJvlOUSE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-19 


816 


X14168 


Human pLC46 
with DNA 
replication origin 


5.00E-16 


ZN44_HUMAN 


ZINC FINGER PROTEIN 
44 (ZINC FINGER 
PROTEIN KOX7) 


1 .o 


817 


Ml 9262 


Ratclathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.28 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


A 1 1 


818 


AF058055 


Mus musculus 
monocarboxylate 
transporter 1 


0.2 


<NONE> 


<NONE> 


<NONE> 


819 


ABO 14570 


Homo sapiens 
mRNA for 
K1AA0670 
protein, partial 
cds 


0.16 


YGR1 YEAST 


HYPOTHETICAL 34.8 

t/- T\ nn ATC1X1 IXT C I TT*1 

KD FKU 1 blN IN bU 1 1 - 
RCK1 INTERGENIC 
REGION 


4.00E-06 


820 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.27 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.5 j 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


821 


| Z54367 


H.sapiens gene 
for plectin 


0.29 


Y093_CAEEL 


HYPOTHETICAL 58.5 
KD PROTEIN T20B 1 2.3 
IN CHROMOSOME III 


1.00E-14 


822 


ABO 17026 


Mus musculus 
mRNA fnr 

III IX 1 ^< 1\J1 

oxysterol- 
binding protein, 
complete cds 


0 


OXYBHUMAN 


OXYSTEROL-BINDING 


2.00E-49 


823 


X58170 


M.musculus 
mRNA for t- 
Complex Tcp- 
10a gene 


1.00E-20 


UL52_HSV11 


DNA 

HELICASE/PRIMASE 
COMPLEX PROTEIN 
(DNA REPLICATION 
PROTEIN UL52) 


5.3 


824 


X58430 


Human Hoxl.8 
gene 


0 


HXAAHUMAN 


HOMEOBOX PROTEIN 

HOX-AIO(HOX-IH) 

(HOX-1.8)(PL) 


1.00E-44 


825 


X53754 


Porcine 

sarcoplasmic/end 
oplasmic- 
reticulum Ca(2+) 
pump gene 2 3'- 
end region 


1.3 


<NONE> 


<NONE> 


<NONE> 


826 


AB005786 


Arabidopsis 
thaliana tRNA- 
Glu gene 


0.46 


<NONE> 


<NONE> 


<NONE> 


827 


AB012130 


Homo sapiens 
SBC2 mRNA for 
sodium 
bicarbonate 
cotransporter2, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


828 


AB017430 


Homo sapiens 
mRNA for 
kinesin-like 
DNA binding 
protein, complete 
cds 


0 


YBAV_ECOLI 


HYPOTHETICAL 12.7 
KD PROTEIN IN HUPB- 
COF INTERGENIC 
REGION 


0.063 


829 


AB007886 


Homo sapiens 
KIAA0426 
mRNA, 
complete cds 


0.042 


YDF3_SCHPO 


PROBABLE 
EUKARYOTIC 
INITIATION FACTOR 
C17C9.03 


0.52 


830 


AB018335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


e-172 


UROT_BOVIN 


TISSUE PLASMINOGEN 
ACTIVATOR 
PRECURSOR (EC 
3.4.21.68) 


0.86 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundan 


t Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


831 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


9.00E-96 


832 


U38376 


Rattus 
norvegicus 
cytosolic 
phospholipase 
A2 mRNA, 
complete cds 


0.048 


<NONE> 


<NONE> 


<NONE> 


833 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


h.kJkJll-ok) 


834 


U08110 


Mus musculus 
RNA1 homolog 
(Fugl)mRNA, 


8.00E-04 


YNW7_YEAST 


HYPOTHETICAL 68.8 
KD PROTEIN IN URE2- 
SSU72 INTERGENIC 
REGION 


0.02 




D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


836 




Mrm^p mRNA 

IVIUUjv Illlvl lii 

for SDF2, 
complete cds 


1 .00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


837 


U67459 


Methanococcus 
jannaschii 
section 1 of 1 50 
of the complete 
genome 


5.00E-05 


GCS INHUMAN 


MANNOSYL- 
OLIGOSACCHARIDE 
GLUCOSIDASE(EC 
3.2.1.106) 


9.2 


838 


U 18657 


Haemophilus 
influenzae LeuA 
(leuA) gene, 
partial cds, DprA 
(dprA+), orf272 
and orfl93 
genes, complete 
cds, and PfkA 
(pfkA) gene, 
partial cds. 


0.01 


STE6_YEAST 


MATING FACTOR A 
SECRETION PROTEIN 
STE6 (MULTIPLE 
DRUG RESISTANCE 
PROTEIN HOMOLOG) 
(P-GLYCOPROTEIN) 


7 
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Table 2 





Nearest Neighbor (BlastN vs, Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


839 


! U 12523 


Rattus 
norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence- 


1.00E-10 


YMT8_YEAST 


HYPOTHETICAL 36.4 
KD PROTEIN IN 
NUP1 16-FAR3 
INTERGENIC REGION 


2.00E-06 


840 


D78255 


Mouse mRNA 
for PAP-1, 
complete cds 


e-175 


<NONE> 


<NONE> 


<NONE> 


841 


D 17263 


Human HepG2 
3' region Mbol 
cDNA, clone 
hmd5f07m3 


1.00E-58 


<NONE> 


<NONE> 


<NONE> 


842 


AF006751 


Homo sapiens 
ES/130mRNA, 
complete cds 


0.061 


YRP2YEAST 


HYPOTHETICAL 84.4 
KD PROTEIN IN 
RPC2/RET1 3'REGION 


2.00E-07 


843 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


6.00E-05 


YC14_METJA 


HYPOTHETICAL 
PROTEIN MJ1214 


8.1 


844 


D88689 


Mus musculus 
mRNA for flM, 
complete cds 


0.084 


ICP0JHSV2H 


TRANS-ACTING 
TRANSCRIPTIONAL 
PROTEIN ICP0 
(VMW1 18 PROTEIN) 


0.014 



288 



BNSDOCID: <WO 9933982A2J_> 




WO 99/33982 
Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


M00001340B:A06 


17062 


3 


0 


0 


M00001340D:F10 


11589 


2 


2 


1 


M00001341A:E12 


4443 


10 


6 


2 


M00001342B:E06 


39805 


2 


0 


0 


M00001343C:F10 


2790 


7 


15 


13 


M00001343D:H07 


23255 


3 


0 


1 


M00001345A:E01 


6420 


8 


0 


2 


M00001346A:F09 


5007 


4 


8 


3 


M00001346D:E03 


6806 


5 


2 


1 


M00001346D:G06 


5779 


5 


4 


3 


M00001346D:G06 


5779 


5 


4 


3 


M00001347A:B10 


13576 


5 


0 


0 


M00001348B:B04 


16927 


4 


0 


0 


M00001348B:G06 


16985 


4 


0 


0 


M00001349B:B08 


3584 


5 


11 


5 


MO00O1350A:H01 


7187 


5 


3 


1 


M00001351B:A08 


3162 


10 


14 


1 


M00001351B:A08 


3162 


10 


14 


1 


M00001352A:E02 


16245 


4 


0 


0 


MO0OO1353A:G12 


8078 


4 


3 


1 


M00001353D:D10 


14929 


4 


0 


0 


M00001355B:G10 


14391 


3 


1 


0 


M00001357D:D11 


4059 


8 


6 


8 


M00001361A:A05 


4141 


5 


2 


10 


MO0OO1361D:FO8 


2379 


26 


13 


4 


M00001362B:D10 


5622 


7 


4 


2 


M00001362C:H11 


945 


9 


21 


2 


MO0O01365C:C10 


40132 


2 


0 


0 


M00001370A:C09 


6867 


7 


3 


0 


M00001371C:E09 


7172 


3 


5 


1 


M00001376B:G06 


17732 


1 


3 


5 


M00001378B-.B02 


39833 


2 


0 


0 


M00001379A:A05 


1334 


27 


38 


35 


M00001380D:B09 


39886 


2 


0 


0 


M00001382C:A02 


22979 


2 


1 


0 


MO0O01383A:CO3 


39648 


2 


0 


0 


M00O01383A:CO3 


39648 


2 


0 


0 


M00001386C:B12 


5178 


5 


5 


4 


M00001387A:C05 


2464 


5 


19 


25 


M00O01387B:GO3 


7587 


6 


2 


1 


MO0O01388D:GO5 


5832 


10 


3 


0 


M00001389A:C08 


16269 


3 


0 


0 


M00001394A:F01 


6583 


2 


7 


3 


M00001395A:C03 


4016 


5 


14 


0 


M00001396A:C03 


4009 


6 


4 


13 


M00001402A:E08 


39563 


2 


0 


0 
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Clones in Clones in Clones in 



Lib4 


Lib8 


Lib9 


0 


0 


0 


3 


3 


8 


6 


3 


11 


0 


1 


0 


14 


6 


0 


1 


0 


0 


0 


1 


0 


6 


2 


6 


2 


0 


3 


4 


0 


0 


4 


0 


0 


0 


12 


11 


2 


0 


0 


0 


0 


0 


0 


0 


2 


0 


1 


0 


6 


6 


5 


6 


6 


5 


0 


0 


0 


0 


1 


0 


1 


23 


16 


0 


0 


0 


16 


0 


1 


16 


4 


27 


2 


2 


3 


13 


1 


2 


1 


0 


0 


0 


3 


0 


0 


0 


0 


2 


0 


1 


0 


1 


4 


0 


0 


0 


28 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


2 


5 


2 


16 


1 


0 


0 


0 


0 


1 


5 


0 


0 


1 


1 


2 


0 


0 


6 


0 


0 


5 


4 


10 


0 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


f^lnm** in 

V^IUIICS III 


Plnnps in 
V^-IvIlC^ III 


PlnnM in 
V^lUJlvd III 


Clones in 


clones in 


clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


M-iiMJO 




M00001407B:D11 


5556 


8 


1 


5 


0 


2 


o 


M00001409C:D12 


9577 


5 


2 


0 


1 


1 1 


12 


M00001410A:D07 


7005 


8 


2 


0 


0 


0 


o 


M00001412B:B10 


8551 


4 


4 


0 


3 


0 


o 


M00001415A:H06 


13538 


5 


0 


0 


0 


9 


1 


M00001416A:H01 


7674 


5 


2 


0 


5 


0 


o 


M00001416B:H11 


8847 


4 


1 


3 


0 


6 


1 


M00001417A:E02 


36393 


2 


0 


0 


1 


o 


o 


M00001418B:F03 


9952 


4 


2 


1 


1 


o 


o 


M00001418D:B06 


8526 


3 


2 


1 


5 


I 


o 


M00001421C:F01 


9577 


5 


2 


o 


1 


1 1 

1 1 


1? 


M00001423B:E07 


15066 


4 


0 


o 


o 




o 


M00001424B:G09 


10470 


5 


1 


o 


2 


o 


1 

1 


M00001425B:H08 


22195 


3 


0 


o 


o 


0 




M00001426D:C08 


4261 


4 


9 


7 


o 


1? 


i <; 


M00001428A:H10 


84182 


1 


0 


o 


o 


0 




M00001429A:H04 


2797 


15 


1 1 


18 


16 


1 

1 


1 4 


M00001429B:A11 


4635 


7 


9 


2 


o 


o 


o 


M00001429D:D07 


40392 


2 


0 


1 


o 


12 


1 A 

1 o 


M00001439C:F08 


40054 


1 


o 


o 


0 


o 


0 


M00001442C:D07 


16731 


3 


1 


o 


o 


o 




M00001445A:F05 


13532 


3 


2 


1 


o 


1 

1 


2 


M00001446A:F05 


7801 


5 


2 


4 




1 

1 


0 


M00001447A:G03 


10717 


7 


2 


0 


5 


g 




M00001448D:C09 


8 


1850 


2127 


1703 


3133 


i ~> j ~j 


1 22 


M00001448D:H01 


36313 


2 


0 


0 


o 


\ 


10 


M00001449A:A12 


5857 


6 


2 


3 


4 


o 


o 

w 


M00001449A:B12 


41633 


1 


1 


0 


o 


o 


o 


M00001449A:D12 


3681 


12 


5 


10 


1 


2 




M00001449A:G10 


36535 


2 


0 


o 


o 


o 


o 


M00001449C:D06 


86110 


1 


0 


o 


o 


o 


o 


M00001450A:A02 


39304 


2 


0 


o 


o 


o 


o 


M00001450A:A11 


32663 


1 


1 


0 


o 


o 


o 


M00001450A:B12 


82498 


1 


0 


0 


0 


o 


o 


M00001450A:D08 


27250 


2 


0 


0 


0 


o 


o 


M00001452A:B04 


84328 


1 


0 


0 


0 


o 


o 


M00001452A:B12 


86859 


1 


0 


0 


0 


o 


o 


M00001452A:D08 


1120 


44 


41 


5 


11 


5 


o 


M00001452A:F05 


85064 


1 


0 


0 


0 


0 


o 


M00001452C:B06 


16970 


4 


0 


0 


0 


3 


4 


M00001453A:E11 


16130 


3 


1 


0 


0 


0 


1 


M00001453C:F06 


16653 


3 


1 


0 


0 


0 


0 


M00001454A:A09 


83103 


1 


0 


0 


0 


0 


0 


M00001454B:C12 


7005 


8 


2 


0 


0 


0 


0 


MO0O01454D:G03 


689 


58 


95 


17 


36 


66 


95 


M00001455A:E09 


13238 


4 


1 


0 


0 


0 


0 


M00001455B:E12 


13072 


4 


1 


0 


0 


0 


0 


M00001455D:F09 


9283 


4 


1 


0 


1 


0 


1 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



f Inne Name 

11/11 C 1 1 U 111 v- 


Cluster 


Clones in 


Clones 




ID 


Libl 


Lib2 


M00001455D:F09 


9283 


4 


1 


M00001460A:F06 


2448 


23 


22 


M00001460A:F12 


39498 


2 


0 


M00001461A:D06 


1531 


20 


23 


M00001463C:B11 


19 


1415 


1203 


M00001465A:B11 


10145 


2 


0 


M00001466A:E07 


4275 


11 


2 


M00001467A:B07 


38759 


2 


0 


M00001467A:D04 


39508 


2 


0 


M00001467A:D08 


16283 


3 


0 


M00001467A:D08 


16283 


3 


0 


M00001467A:E10 


39442 


2 


0 


M00001468A:F05 


7589 


6 


2 


M00001469A:C10 


12081 


4 


0 


M00001469A:H12 


19105 


2 


0 


M00001470A:B10 


1037 


53 


48 


M00001470A:C04 


39425 


2 


0 


M00001471A:B01 


39478 


2 


0 


M00001481D:A05 


7985 


3 


1 


M00001490B:C04 


18699 


2 


1 


M00001494D:F06 


7206 


4 


3 


M00001497A:G02 


2623 


12 


4 


M00001499B:A11 


10539 


2 


1 


M00001500A:C05 


5336 


9 


2 


M00001500A:E11 


2623 


12 


4 


M00001500C:E04 


9443 


4 


2 


M00001501D:C02 


9685 


3 


2 


M00001504C:A07 


10185 


5 


1 


M00001504C:H06 


6974 


7 


3 


M00001504D:G06 


6420 


8 


0 


M00001507A:H05 


39168 


2 


0 


M00001511A:H06 


39412 


2 


0 


M00001512A:A09 


39186 


2 


0 


M00001512D:G09 


3956 


9 


9 


M0O001513A:B06 


4568 


10 


4 


M00001513C:E08 


14364 


1 


0 


M00001514C:D11 


40044 


2 


0 


M00001517A:B07 


4313 


13 


6 


M00001518C:B11 


8952 


3 


4 


M00001528A:C04 


7337 


4 


4 


M00001528A:F09 


18957 


3 


0 


M00001528B:H04 


8358 


3 


3 


M00001531A:D01 


38085 


2 


0 


M00001532B:A06 


3990 


6 


12 


M00001533A:C11 


2428 


14 


14 


M00001534A:C04 


16921 


4 


0 


M00001534A:D09 


5097 


6 


5 


M00001534A:F09 


5321 


11 


7 
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lones in 


Clones in 


Clones in 


Clones 


Lib3 


Lib4 


Lib8 


Lib9 


0 


1 


0 


1 


2 


3 


3 


1 


0 


0 


0 


0 


32 


17 


14 


14 


1364 


525 


479 


774 


2 


0 


0 


0 


5 


0 


4 


2 


0 


0 


1 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


1 


1 


0 


0 


0 


0 


0 


2 


0 


1 


0 


4 


22 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


4 


0 


1 


0 


0 


0 


0 


3 


3 


1 


2 


0 


31 


4 


6 


1 


1 


0 


1 


0 


4 


8 


3 


15 


31 


4 


6 


1 


1 


1 


0 


0 


0 


7 


2 


3 


0 


0 


2 


4 


0 


1 


0 


0 


2 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


5 


2 


0 


0 


0 


9 


2 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


1 


0 


0 


4 


2 


0 


3 


16 


12 


21 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


0 


0 


4 


1 


3 


1 


13 


9 


2 


19 


0 


1 


2 


1 


1 


1 


3 


2 


1 


5 


10 


26 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


f^ifknpc in 

VlUlICo 111 


Inn ac in 


v^ioncs in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 






M00001534C:A01 


4119 


9 


4 


2 


2 


5 


3 


M00001535A:B01 


7665 


3 


1 


5 


0 


o 


o 


M00001535A:C06 


20212 


2 


0 


1 


1 


o 


o 


M00001535A:F10 


39423 


2 


0 


0 


0 


o 


o 


M00001536A:B07 


2696 


23 


1 1 


9 


18 


10 


21 


M00001536A:C08 


39392 


2 


0 


0 


0 


o 


o 


M00001537A:F12 


39420 


2 


0 


0 


0 


o 


o 


M00001537B:G07 


3389 


4 


11 


13 


2 


o 


o 


M00001540A:D06 


8286 


6 


1 


0 


3 


4 


o 


M00001541A:D02 


3765 


19 


6 


0 


0 


o 


o 


M0OOOI541A:FO7 


22085 


3 


0 


0 


0 


o 


1 

1 


M00001541A:H03 


39174 


2 


0 


0 


o 


n 


o 


M00001542A:A09 


22113 


3 


0 


o 


o 


o 


o 


M00001542A:E06 


39453 


2 


o 


0 


o 




n 


M00001544A:E03 


12170 


2 


1 


2 


o 


o 


n 


M00001544A:G02 


19829 


2 


0 


1 


o 


o 




M00001544B:B07 


6974 


7 


3 


o 


1 


o 


n 


M00001545A:C03 


19255 


2 


o 


0 


o 




A 
U 


M00001545A:D08 


13864 


3 


o 


2 


1 




A 

w 


M00001546A:G11 


1267 


43 


55 


5 


o 




0 


M00001548A:E10 


5892 


5 


1 


4 


4 


1 
i 


"2 
-> 


M00001548A:H09 


1058 


40 


44 


37 


47 




SQ 


M00001549A:B02 


4015 


10 


5 


8 


15 


2 


o 


M00001549A:D08 


10944 


3 


0 


3 


1 


n 


7 


M00001549B:F06 


4193 


12 


7 


2 


2 


n 


1 
■ 


M00001549C:E06 


16347 


4 


0 


0 


0 


o 


o 


M00001550A:A03 


7239 


5 


2 


1 


o 


2 




M00001550A:G01 


5175 


8 


1 


3 


2 


o 


o 


M00001551A:B10 


6268 


6 


4 


3 


18 


5 


o 


M00001551A:F05 


39180 


2 


0 


0 


0 


o 


o 


M00001551A:G06 


22390 


2 


1 


0 


0 


o 


1 


M00001551C:G09 


3266 


12 


14 


0 


1 


o 


6 


M00001552A:B12 


307 


73 


60 


196 


75 


79 


27 


M00001552A:D11 


39458 


2 


0 


0 


0 


0 


o 


M00001552B:D04 


5708 


5 


4 


4 


3 


1 


4 


M00001553A:H06 


8298 


4 


3 


1 


3 


0 


o 


M00001553B:F12 


4573 


5 


7 


2 


5 


0 


1 


M00001553D:D10 


22814 


3 


0 


0 


0 


0 


o 


M00001555A:B02 


39539 


2 


0 


0 


0 


1 


o 


M00001555A:C01 


39195 


2 


0 


0 


0 


0 


o 


M00001555D:G10 


4561 


8 


4 


4 


8 


0 


0 


M00001556A:C09 


9244 


2 


0 


3 


2 


10 


17 


M00001556A:F11 


1577 


12 


40 


25 


3 


4 


0 


M00001556A:H01 


15855 


2 


1 


1 


2 


12 


213 


M00001556B:C08 


4386 


7 


8 


3 


1 


3 


21 


M00001556B:G02 


11294 


4 


0 


2 


0 


0 


1 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



iodine 


Cluster 


Clones in 


Clones 




ID 


Libl 


Lib2 


M00001557A:F01 


9635 


3 


0 


M00001557A:F03 


39490 


2 


0 


M00001557B:H10 


5192 


8 


5 


M00001557D:D09 


8761 


3 


4 


M00001558B:H11 


7514 


5 


3 


M00001560D:F10 


6558 


4 


3 


MOO001561A:CO5 


39486 


2 


0 


M00001563B:F06 


102 


289 


233 


M00001564A:B12 


5053 


11 


4 


M00001571C:H06 


5749 


4 


1 


M00001578B:E04 


23001 


2 


1 


M00001579D:C03 


6539 


8 


3 


MOOO01583D:A10 


6293 


3 


5 


M00001586C:C05 


4623 


3 


4 


M00001587A:B11 


39380 


2 


0 


M00001594B:H04 


260 


189 


188 


M00001597C:H02 


4837 


6 


2 


M00001597D:C05 


10470 


5 


1 


M00001598A:G03 


16999 


4 


0 


M00001601A:D08 


22794 


2 


0 


M00001604A:B10 


1399 


49 


27 


M00001604A:F05 


39391 


2 


0 


M00001607A:E1 1 


11465 


5 


0 


M00001608A:B03 


7802 


5 


4 


M00001608B:E03 


22155 


3 


0 


M00001614C:F10 


13157 


4 


1 


M00001617C:E02 


17004 


4 


0 


M00001619C:F12 


40314 


2 


0 


M00001621C:C08 


40044 


2 


0 


M00001623D:F10 


13913 


2 


1 


M00001624A:B06 


3277 


10 


11 


M00001624C:F01 


4309 


4 


13 


M00001630B:H09 


5214 


10 


2 


M00001644C:B07 


39171 


2 


0 


M00001645A:C12 


19267 


2 


0 


M00001648C:A01 


4665 


5 


9 


M00001657D:C03 


23201 


3 


0 


M00001657D:F08 


76760 


1 


0 


M00001662C:A09 


23218 


3 


0 


M00001663A:E04 


35702 


2 


0 


M00001669B:F02 


6468 


4 


3 


M00001670C:H02 


14367 


3 


0 


M00001673C:H02 


7015 


6 


3 


M00001675A:C09 


8773 


4 


1 


M00001676B:F05 


114.60 


4 


2 


M00001677C:E10 


14627 


1 


2 


M00001677D:A07 


7570 


5 


3 


M00001678D:F12 


4416 


9 


5 




PCT/US98/27610 



lones in 


Clones in 


Clones in 


Clones 


Lib3 


Lib4 


Lib8 


Lib9 


2 


1 


0 


0 


0 


0 


1 


0 


0 


5 


0 


0 


0 


1 


0 


1 


0 


0 


0 


0 


4 


0 


0 


5 


0 


0 


0 


0 


278 


116 


123 


184 


2 


2 


1 


1 


9 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


1 


2 


6 


0 


0 


12 


2 


1 


1 


0 


0 


0 


0 


27 


2 


15 


0 


10 


0 


3 


1 


0 


2 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


19 


7 


10 


23 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


3 


1 


0 


1 


0 


1 


0 


0 


0 


1 


0 


0 


0 


0 


0 


2 


0 


0 


1 


8 


3 


5 


1 


3 


10 


0 


0 


2 


2 


4 


3 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


3 


0 


2 


2 


0 


5 


0 


0 


0 


0 


0 


0 


0 


0 


3 


8 


1 


0 


0 


0 


0 


0 


1 


2 


1 


1 


4 


4 


4 


6 


0 


0 


0 


0 


1 


0 


1 


0 


0 


0 


0 


0 


2 


6 


1 


3 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



i^ione ixamc 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




in 


I lhl 


I 5K9 
LIDX 


LID3 


L.ib4 


Lib8 


Lib9 


M00001679A:A06 


6660 


7 


o 




9 
z 


i 
1 


0 


M00001679A:F10 


26875 


] 


o 


A 


A 

u 


i 
1 


0 


M00001679B:F01 


6298 


2 


4 


J 




i 
l 


/% 
U 


M00001679C:F01 


78091 


1 


o 


A 
U 


A 
U 


A 


0 


M00001679D:D03 


10751 


3 


z. 


A 

u 


1 
1 


V 


i 
I 


M00001679D:D03 


10751 


3 


Z 


A 
U 


1 


(J 


1 


M00001680D:F08 


10539 


2 


1 
1 


1 
1 


A 
U 


1 

1 


r\ 
0 


M00001682C:B12 


17055 


4 


o 


A 
\J 


A 
U 


A 


U 


M00001686A:E06 


4622 


7 


t> 


A 


z 


3 


0 


M00001688CF09 


5382 


6 


9 
z 


/: 
O 


z 


0 


3 


M00001693C-G01 


4393 


1 0 


O 


Z 


4 


t 

1 


1 


M00001716DH05 

1 ▼ A WW 1 / A UL/«1 Xv*y 


67252 


1 
i 




A 

u 


1 

1 


0 


0 


M0000374 1 DC09 




Z 


u 


A 

u 


0 


0 


0 


M00003747DC0S 


1 1476 


f. 

o 


A 

u 


A 


0 


0 


0 


M00003759R-R09 


697 


76 


^9 


1 A 


72 


21 


30 


M00003762C-R08 


1 7076 




n 
u 


U 


0 


0 


0 


M00003761 A *F06 


1 1 08 


14 
1 H 


1 1 


/ 


5 


0 


1 


M00003774C-A03 


67Q07 


1 
1 


u 


r\ 
U 


0 


0 


0 


M00003796C-nOS 


S619 


-2 
-> 


c 


3 


3 


0 


4 


M00003826RAO6 


1 1 3S0 


-> 


-> 


U 


0 


1 


0 


M00003833 A *F0S 


? 1 877 


9 
z 


1 
1 


U 


0 


0 


1 


M00003837DA01 


7899 


-> 




0 


2 


1 


0 


M00003839A *D08 


7798 




9 


1 


0 


0 


1 


M00003844CR1 1 


6S39 


o 
o 


J 


r\ 
U 


a 
0 


0 


1 


M00003846R D06 


6874 


u 




A 


u 


0 


0 


M00003851RD10 


1 3S9S 


A 
*+ 


A 
U 


1 


A 

0 


0 


1 


M00003853 A D04 


S619 






-5 

3 


-J 


0 


4 


M00003853AF12 






1 
1 


A 

u 


1 


1 


2 


M00003856B:C02 


4622 


7 


U 


/I 


-> 
z 


-> 

3 


0 


M00003857AG 1 0 


3389 


4 


1 1 

1 1 


1 1 
1 J 


z 


0 


0 


M00003857AH03 


4718 


4 




c 


z 


4 


6 


M00003871C:E02 


4573 




7 


9 




u 


1 


M00003875B:F04 


12977 


5 




A 

u 


n 
U 


0 


0 


M00003875B:F04 


12977 


5 


o 


n 
w 


A 
U 


A 


A 


M00003875C:G07 


8479 


4 


-x 


1 
1 


1 
J 


z 


4 


M00003876D:E12 


7798 


5 


9 

z. 


9 
z 


n 
u 


A 


l 


M00003879B:C1 1 


5345 


7 


i 


7 

/ 


A 


o 


2.1 


MOOOO3879B:D10 


31587 


1 




n 


A 
U 


1 
1 


A 


M00003879D:A02 


14507 


3 


1 


o 

u 


A 
U 


3 


i 
1 


M00003885C:A02 


13576 


5 


o 


A 


A 
U 


Iz 


I I 


M0O003885C:AO2 


13576 


5 


0 


o 


o 


12 


1 1 
1 I 


M00003906C:E10 


9285 


4 


3 


0 


0 


1 


2 


M00003907D:A09 


39809 


1 


0 


0 


0 


2 


1 


M00003907D:H04 


16317 


3 


0 


0 


0 


0 


0 


M00003909D:C03 


8672 


4 


4 


0 


0 


0 


0 


M00003912B:D01 


12532 


4 


1 


0 


1 


0 


1 


M00003914C:F05 


3900 


9 


6 


8 


1 


7 


13 


M00003922A:E06 


23255 


3 


0 


1 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00003958A:H02 


18957 


3 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


3 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


2 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


2 


0 


0 


0 


0 


0 


M00003968B:F06 


24488 


2 


0 


1 


4 


0 


0 


M00003970C:B09 


40122 


2 


0 


0 


0 


0 


0 


M00003974D:E07 


23210 


3 


0 


0 


0 


0 


0 


M00003974D:H02 


23358 


3 


0 


0 


0 


1 


0 


M00003975A:G11 


12439 


4 


0 


0 


0 


0 


0 


M00003978B:G05 


5693 


7 


4 


1 


3 


1 


1 


M00003981A:E10 


3430 


9 


10 


7 


3 


0 


0 


M00003982C:C02 


2433 


10 


13 


21 


18 


8 


8 


M00003983A:A05 


9105 


5 


1 


1 


1 


0 


0 


M00004028D:A06 


6124 


4 


8 


1 


9 


1 


0 


M00004028D:C05 


40073 


2 


0 


1 


0 


0 


1 


M00004031A:A12 


9061 


5 


2 


0 


0 


0 


0 


M00004031A:A12 


9061 


5 


2 


0 


0 


0 


0 


M00004035C:A07 


37285 


2 


0 


0 


1 


0 


1 


M00004035D:B06 


17036 


4 


0 


0 


0 


0 


0 


M00004059A:D06 


5417 


10 


4 


0 


9 


2 


0 


M00004068B:A01 


3706 


7 


14 


4 


22 


1 


0 


M00004072B:B05 


17036 


4 


0 


0 


0 


0 


0 


M00004081C:D10 


15069 


3 


0 


0 


1 


0 


0 


M00004081C:D12 


14391 


3 


1 


0 


0 


0 


0 


M00004086D:G06 


9285 


4 


3 


0 


0 


1 


2 


M00004087D:A01 


6880 


2 


6 


1 


1 


0 


0 


M00004093D:B12 


5325 


5 


5 


2 


0 


2 


1 


M00004093D:B12 


5325 


5 


5 


2 


0 


2 


1 


M00004105C:A04 


7221 


5 


2 


2 


2 


0 


0 


M00004108A:E06 


4937 


4 


9 


3 


1 


3 


1 


M00004111D:A08 


6874 


6 


3 


0 


0 


0 


0 


M00004114C:F11 


13183 


2 


3 


0 


7 


0 


1 


M00004138B:H02 


13272 


3 


2 


0 


3 


0 


0 


M00004146C:C11 


5257 


2 


8 


5 


5 


5 


25 


M00004151D:B08 


16977 


4 


0 


0 


0 


0 


0 


M00004157C:A09 


6455 


3 


1 


6 


0 


0 


0 


M00004169C:C12 


5319 


6 


2 


8 


2 


2 


3 


M00004171D:B03 


4908 


6 


7 


2 


2 


2 


0 


M00004172C:D08 


11494 


4 


0 


0 


0 


0 


0 


M00004183C:D07 


16392 


3 


0 


0 


0 


0 


0 


MOO004185C:CO3 


11443 


5 


1 


0 


0 


0 


0 


M00004197D:H01 


8210 


2 


6 


0 


0 


0 


0 


M00004203B:C12 


14311 


4 


0 


0 


0 


1 


2 


M00004212B:C07 


2379 


26 


13 


4 


2 


2 


3 


M00004214C:H05 


11451 


3 


2 


1 


2 


1 


1 


M00004223A:G10 


16918 


4 


0 


0 


0 


0 


0 


M00004223B:D09 


7899 


5 


4 


0 


2 


1 


0 


M00004223D:E04 


12971 


4 


0 


0 


0 


1 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



\ luuc lvalue 




Clones in 


Clones 




ID 


I iht 




M00004229B:F08 


6455 


3 


1 


M00004230B:C07 


7212 


3 


5 


M00004269D:D06 


4905 


7 


6 


M00004275C:C11 


16914 


3 


o 


M00004283B:A04 


14286 


3 


1 


M00004285B:E08 


56020 


1 


o 


M00004295D:F12 


16921 


4 


0 


M00004296C:H07 


13046 


4 


1 


M00004307C:A06 


9457 


2 


0 


M00004312A:G03 


26295 


2 


0 


M00004318C:D10 


21847 


2 


1 


M00004372A:A03 


2030 


13 


10 


M00004377C:F05 


2102 


12 


20 



Clones in 


Clones in 


Clones in 


Clones 








Lid9 






A 
u 


u 


2 






A 
U 


3 


1 


*3 
J 


1 

I 


o 


1 
i 


A 
U 


A 


o 


1 


1 
1 


I 


o 

sj 




A 
U 


f\ 

u 


0 


1 


2 


1 


0 


1 


0 


0 


5 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


32 


4 


0 


0 


23 


21 


6 


5 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


M00001340B:A06 


17062 


0 


0 


M00001340D:F10 


1 1589 


0 


0 


M00001341A:E12 


4443 


0 


0 


M00001342B:E06 


39805 


0 


0 


M00001343C:F10 


2790 


0 


0 


M00001343D-.H07 


23255 


0 


0 


MOO001345A:E01 


6420 


0 


0 


M00001346A:F09 


5007 


0 


0 


M00001346D:E03 


6806 


0 


0 


M00001346D:G06 


5779 


0 


0 


M00001346D:G06 


5779 


0 


0 


M00001347A:B10 


13576 


0 


0 


M00001348B:B04 


16927 


0 


0 


MOO001348B:GO6 


16985 


0 


0 


M00001349B:B08 


3584 


0 


0 


M00001350A:H01 


7187 


0 


0 


M00001351B:A08 


3162 


0 


1 


MOO001351B:AO8 


3162 


0 


1 


M00001352A-.E02 


16245 


0 


0 


M00001353A:G12 


8078 


0 


0 


M00001353D:D10 


14929 


0 


3 


M00001355B:G10 


14391 


0 


0 


M00001357D:D11 


4059 


0 


0 


M00001361A:A05 


4141 


0 


0 


M00001361D:F08 


2379 


0 


0 


M00001362B:D10 


5622 


0 


0 


M00001362C:H11 


945 


0 


0 


M00001365C:C10 


40132 


0 


0 


M00001370A:C09 


6867 


0 


0 


M00001371C:E09 


7172 


0 


0 


M00001376B:G06 


17732 


0 


0 


M00001378B:B02 


39833 


0 


0 


M00001379A:A05 


1334 


0 


0 


M0O001380D:BO9 


39886 


0 


0 


MOO001382C:AO2 


22979 


0 


0 


M0O001383A:CO3 


39648 


0 


0 


MOO0O1383A:CO3 


39648 


0 


0 


M00001386C:B12 


5178 


0 


0 


M00001387A:C05 


2464 


0 


0 


M00001387B:G03 


7587 


0 


0 


MOO001388D:GO5 


5832 


0 


0 


M00001389A:C08 


16269 


0 


1 


M00001394A:F01 


6583 


1 


4 


M0O001395A:CO3 


4016 


0 


0 


M00001396A:C03 


4009 


0 


0 


M00001402A:E08 


39563 


0 


0 


M00001407B:D11 


5556 


0 


0 


M00001409C:D12 


9577 


0 


0 
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Clones in Clones in Clones in Clones in 



Libl7 


Libl8 


Libl9 


Lib20 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


5 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



v-ione ixanie 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones i 




in 


T ih1^ 

JL/1U 1 Z> 


J_JD1©D 


IjIDI / 


T ;ki o 
J^IDlo 


L.1D1V 


JLlDZO 


M00001410A:D07 


7005 


o 


o 




n 


A 


A 
U 


M00001412B:B10 


8551 


o 


o 


o 


0 


A 
U 


A 

yJ 


M00001415A:H06 


13538 


o 


o 


n 
w 


A 
u 


A 

u 


A 

u 


M00001416A:H01 


7674 


o 


o 


n 


A 


A 
U 


A 


M00001416B.H1 1 


8847 


o 


o 


o 

u 


A 


A 

u 


A 


M00001417A:E02 


36393 


o 


o 


n 
u 


A 
U 


A 

u 


A 
U 


M00001418B:F03 


9952 


o 


o 

yj 




A 


A 
U 


A 
U 


M00001418D:B06 


8526 


o 


o 

yj 




A 
V 


A 
U 


A 
U 


M00001421C:F01 


9577 


o 


o 


0 


A 


A 
U 


A 


M00001423B:E07 


15066 


o 




o 


A 


A 


A 
U 


M00001424B:G09 


10470 


o 


o 


0 


A 
U 


A 
U 


A 
U 


M00001425B-H08 


22195 


o 


o 

yj 


n 

u 


A 
U 


A 
U 


A 
U 


M00001426DC08 


4261 


n 

yj 


o 

yj 


1 

I 


A 
U 


A 
U 


1 
1 


M0000 1 428 A H 1 0 


84182 


A 
yj 


A 
u 


U 


U 


U 


u 


M0000 1 429A H04 


2797 




n 

u 


A 
U 


A 
U 


A 

U 


A 
(J 


M00001429B-A1 1 


4635 


o 

yj 


A 


A 
\J 


A 
U 


A 


r\ 
U 


M0000 1 429DD07 


40392 


o 


A 
\J 


A 
U 


A 


A 

u 


A 


M00001439C-F08 


400S4 


a 

yj 


A 


A 
U 


A 


U 


0 


M0000 1 442CD07 


16711 


yj 


a 
u 


A 
U 


A 


U 


0 


M00001445A-F05 

IT1WW 1 T ~ ^ JX m 1 \J*J 


13532 

1 J J J Am 




n 

V 


A 


A 
U 


A 


u 


M0000 1 446AF05 


7801 


o 


u 


A 

u 


A 

u 


A 
U 


0 


M0000 1 447 A G03 


10717 


o 


n 

u 


A 
U 


A 
U 


A 


U 


M0OO0 1 448DC09 


a 

o 


1 
i 


D 


o 


1 
1 


1 /I 

J 4 


J 


M00001448D-H01 

1™AW W J 1 • U 1— X a 1 IV 1 


36313 


o 


-1 
-> 


A 

u 


A 
U 


J 


U 


M0000 1 449 A - A 1 2 


5857 


o 

w 


n 


A 

u 


A 
U 


A 


u 


M0000 1 449 A B 1 2 


41633 




o 

u 


A 
U 


A 
U 


A 
U 


V 


M00001449A:D12 


3681 




0 


A 
U 


A 
U 


A 

u 


A 


M0000 1 449 A G 1 0 


36535 


o 

yj 


n 


A 

u 


A 

u 


A 


U 


MOOO0 1 449C:D06 


861 10 


o 


o 

yj 


u 


A 
U 


A 


A 
U 


M0000 1 450A : A02 


39304 


o 


o 

yj 


A 


A 

u 


A 

U 


A 


M00001450A:A1 1 


32663 


o 


o 


n 

u 


A 
U 


A 

U 


A 
U 


M00001450A:B12 


82498 


o 


f) 


n 

u 


A 
U 


A 
U 


A 

u 


M00001450A:D08 


27250 


o 


o 


o 


A 


A 

u 


A 

u 


M00001452A:B04 


84328 


o 


o 


n 


A 
U 


A 
U 


A 
U 


M00001452A:B12 


86859 


o 


o 


o 


A 


A 

u 


A 
U 


M00001452A:D08 


1120 


o 


o 


o 


A 
U 


A 

\J 


A 
U 


M00001452A.F05 


85064 


o 


o 


o 


A 
KJ 


A 
U 


A 


M00001452C:B06 


16970 


o 


o 


*<- 


A 

yj 


1 
1 


A 
U 


M00001453A:E11 


16130 


o 


o 


o 


o 


A 
U 


A 


M00001453C:F06 


16653 


o 


o 




o 

yj 


A 
yj 


A 
u 


M00001454A:A09 


83103 


o 


o 


o 


A 

u 


A 

u 


A 


M00001454B:C12 


7005 


o 


o 


o 


A 


A 
U 


A 


M00001454D:G03 


689 


0 


2 


2 


o 


4 




M00001455A:E09 


13238 


0 


0 


0 


0 


0 


0 


M00001455B:E12 


13072 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001460A:F06 


2448 


0 


0 


0 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



L/lOIlv 1^<IIIIC 


Cluster 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


M00001461A:D06 


1531 


0 


0 


M00001463C:B11 


19 


2 


13 


M00001465A:B1 1 


10145 


0 


0 


M00001466A:E07 


4275 


0 


0 


M00001467A:B07 


38759 


0 


0 


M00001467A:D04 


39508 


0 


0 


M00001467A:D08 


16283 


0 


0 


M00001467A:D08 


16283 


0 


0 


M00001467A:E10 


39442 


0 


0 


M00001468A:F05 


7589 


0 


0 


M00001469A:C10 


12081 


0 


0 


M00001469A:H12 


19105 


0 


0 


M00001470A:B10 


1037 


0 


0 


M00001470A:C04 


39425 


0 


0 


M00001471A:B01 


39478 


0 


0 


MOO0O1481D:AO5 


7985 


0 


0 


M00001490B:C04 


18699 


0 


0 


M00001494D:F06 


7206 


0 


0 


M00001497A:G02 


2623 


0 


0 


M00001499B:A11 


10539 


0 


0 


MOO0O15O0A:CO5 


5336 


0 


0 


M00001500A:E11 


2623 


0 


0 


M00001500C:E04 


9443 


0 


0 


M00001501D:C02 


9685 


0 


0 


MOO0O15O4C:AO7 


10185 


0 


0 


M00001504C:H06 


6974 


0 


0 


M00001504D:G06 


6420 


0 


0 


MOO001507A:H05 


39168 


0 


0 


MO00O1511A:HO6 


39412 


0 


0 


MOO0O1512A:AO9 


39186 


0 


0 


M00001512D:G09 


3956 


0 


0 


MO00O1513A:BO6 


4568 


0 


0 


MO00O1513C:EO8 


14364 


0 


0 


M00001514C:D11 


40044 


0 


] 


M00001517A:B07 


4313 


0 


0 


M00001518C:B11 


8952 


0 


0 


M00001528A:C04 


7337 


0 


0 


M00001528A:F09 


18957 


0 


0 


M00001528B:H04 


8358 


0 


0 


M0O0O1531A:D01 


38085 


0 


0 


M00001532B:A06 


3990 


1 


1 


M00001533A:C11 


2428 


0 


0 


M00001534A:C04 


16921 


0 


0 


M00001534A:D09 


5097 


0 


0 


M00001534A:F09 


5321 


0 


1 


M00001534C:A01 


4119 


0 


0 


M00001535A:B01 


7665 


0 


0 


M00001535A:C06 


20212 


0 


0 


M00001535A:F10 


39423 


0 


0 
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Clones in Clones in Clones in Clones in 



Libl7 


Libl8 


Libl9 


Lib20 


0 


0 


0 


0 


13 


0 


69 


10 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


(^Iappc in 
\_-iuiiva III 


ClrniAC in 
V^JUJlCa III 


vlUIlca in 


^iunt» in 


Clones in 


Clones I 




ID 


LiblS 


Libl6b 


Lib 17 


Libl8 


Lib 19 


Lib20 


M00001536A:B07 


2696 


0 


0 


0 


0 


3 


0 


MOOO01536A:C08 


39392 


0 


0 


0 


0 


0 


o 


M00001537A:F12 


39420 


0 


0 


0 


0 


0 


o 


M00001537B:G07 


3389 


0 


0 


0 


0 


0 


o 


M00001540A:D06 


8286 


0 


0 


0 


o 


0 


o 


M00001541A:D02 


3765 


0 


0 


0 


0 


o 


o 


M00001541A:F07 


22085 


0 


0 


0 


o 


0 


o 


M00001541A:H03 


39174 


0 


0 


0 


0 


0 


o 


M00001542A:A09 


22113 


0 


0 


o 


o 


0 


o 


M00001542A:E06 


39453 


0 


0 


0 


o 


o 


o 


M00001544A:E03 


12170 


0 


0 


o 


o 


o 


o 


M00001544A:G02 


19829 


0 


0 


o 


o 


o 


o 


M00001544B:B07 


6974 


0 


0 


o 


o 


o 


n 


M00001545A:C03 


19255 


0 


o 


o 


o 


o 


o 


M00001545A:D08 


13864 


0 


o 


o 


o 


o 




M00001546A:G11 


1267 


1 


o 


o 


o 


7 


yj 


M00001548A:E10 


5892 


0 


o 


o 


o 


o 


yj 


M00001548A:H09 


1058 


o 


o 


1 


o 


o 


o 


M00001549A:B02 


4015 


o 


o 


o 


o 


o 


yj 


M00001549A:D08 


10944 


o 


o 


o 


o 






M00001549B:F06 


4193 


0 


o 


o 


o 


o 


o 


M00001549C:E06 


16347 


0 


o 


o 


o 


o 


o 


MO00O1550A:AO3 


7239 


0 


o 


o 


o 


o 


o 

yj 


M00001550A:G01 


5175 


0 


0 


o 


o 


o 


o 


M00001551A:B10 


6268 


0 


o 


o 


o 


o 


o 

yj 


M00001551A:F05 


39180 


0 


o 


o 


o 


o 


o 


M00001551A:G06 


22390 


0 


o 


o 


o 


o 


o 


M00001551C:G09 


3266 


o 


o 


] 


o 


o 


o 

yj 


M00001552A:B12 


307 


0 


o 


o 


o 


3 


o 


M00001552A:D11 


39458 


0 


0 


o 


o 


o 


o 


M00001552B:D04 


5708 


0 


1 


o 


o 


o 


o 


MO0001553A:H06 


8298 


0 


0 


0 


o 


o 


o 


M00001553B:F12 


4573 


0 


0 


o 


o 


o 


o 


M00001553D:D10 


22814 


0 


0 


0 


o 


o 


o 


M00001555A:B02 


39539 


0 


0 


0 


0 


o 


o 


M00001555A:C01 


39195 


0 


0 


0 


0 


o 


o 


M00001555D:G10 


4561 


0 


0 


0 


0 


o 


o 


M00001556A:C09 


9244 


0 


0 


0 


0 


o 


o 


M00001556A:F11 


1577 


0 


0 


0 


0 


0 


o 


M00001556A:H01 


15855 


3 


5 


5 


0 


3 


1 


M00001556B:C08 


4386 


1 


2 


0 


0 


o 


o 


M00001556B:G02 


11294 


0 


0 


0 


0 


o 


o 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:F01 


9635 


0 


0 


0 


0 


0 


0 


MO0OO1557A:FO3 


39490 


0 


0 


0 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


0 


0 


0 


M00001558B:H11 


7514 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 1 5-20 



V^HJIIC llalllv 


Cluster 


Clones in 


Clones in 




ID 


LiblS 


Libl6b 


M00001560D:F10 


6558 


0 


0 


M00001561A:C05 


39486 


0 


0 


M00001563B:F06 


102 


22 


38 


M00001564A:B12 


5053 


0 


0 


M00001571C:H06 


5749 


0 


0 


M0O001578B:E04 


23001 


0 


0 


M00001579D:C03 


6539 


0 


0 


MOOOO1583D:A10 


6293 


0 


0 


M00001586C:C05 


4623 


0 


0 


M00001587A:B11 


39380 


0 


0 


M00001594B:H04 


260 


0 


0 


M00001597C:H02 


4837 


0 


0 


M00001597D.C05 


10470 


0 


0 


M00001598A:G03 


16999 


1 


1 


M00001601A:D08 


22794 


0 


0 


M00001604A:B10 


1399 


0 


0 


M00001604A:F05 


39391 


0 


0 


M00001607A:E1 1 


11465 


0 


0 


M00001608A:B03 


7802 


0 


0 


M00001608B:E03 


22155 


0 


0 


M00001614C:F10 


13157 


0 


0 


M00001617C:E02 


17004 


0 


0 


M00001619C:F12 


40314 


0 


0 


M00001621C:C08 


40044 


0 


1 


M00001623D:F10 


13913 


0 


0 


M00001624A:B06 


3277 


0 


0 


M00001624C:F01 


4309 


0 


0 


M00001630B:H09 


5214 


1 


0 


M00001644C:B07 


39171 


0 


0 


M00001645A:C12 


19267 


0 


0 


M00001648C:A01 


4665 


0 


0 


M00001657D:C03 


23201 


0 


0 


M00001657D:F08 


76760 


0 


0 


M00001662C:A09 


23218 


0 


0 


M00001663A:E04 


35702 


0 


0 


M00001669B:F02 


6468 


0 


0 


M00001670C:H02 


14367 


0 


0 


M00001673C:H02 


7015 


0 


0 


M00001675A:C09 


8773 


0 


0 


M00001676B:F05 


11460 


0 


0 


M00001677C:E10 


14627 


0 


1 


M00001677D:A07 


7570 


0 


0 


M00001678D:F12 


4416 


0 


0 


M00001679A:A06 


6660 


0 


0 


M00001679A:F10 


26875 


0 


0 


M00001679B:F01 


6298 


0 


0 


M00001679C:F01 


78091 


0 


0 


M00001679D:D03 


10751 


0 


0 


M00001679D:D03 


10751 


0 


0 
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Clones in Clones in Clones in Clones in 



Libl7 


Lib 18 


Libl9 


Lib20 


0 


0 


0 


0 


0 


0 


0 


0 


65 


7 


43 


10 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 1 5-20 







v^iones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 




IjIUIDO 


-LflOl / 


I ;ki 6 


JLiDly 


L.ID2U 


MO0O0!680D:FO8 


10539 


o 


o 


o 

yj 


yj 


A 
v! 


A 
U 


M00001682C:B12 


17055 


o 


o 


o 


o 

yj 


A 
U 


A 
U 


M00001686A:E06 


4622 


o 


o 


yJ 




A 

u 


A 

u 


M00001688C:F09 


5382 


o 


o 


n 
\j 


yj 


A 

\J 


A 


MO0O01693C:G01 


4393 


o 


o 


o 

yj 


yj 


A 
V 


A 
U 


MO00O1716D:H05 


67252 


0 


o 


o 

yj 


o 

yj 


A 


A 
U 


MO0O03741D:CO9 


40108 


o 


o 


o 

yj 


A 
yj 


A 
U 


A 
U 


MO0003747D:CO5 


1 1476 


o 


o 


o 

yj 


o 

yJ 


A 
U 


A 
U 


MO0003759B:B09 


697 


o 


o 


o 

yj 


o 

yJ 


1 
I 


A 


M00003762C:B08 


17076 


o 


o 


yj 


A 
U 


A 
U 


A 


M00003763A:F06 


3108 


o 


o 


o 

yj 


A 
V 


A 
U 


A 


M00003774C:A03 


67907 


o 


o 

yj 


n 

V 


A 
U 


A 
U 


A 
U 


MO0003796C:D05 


5619 


o 


o 


n 


A 
V 


A 
U 


A 


M00003826B:A06 


1 1350 


o 


o 


a 

\J 


A 
U 


A 


0 


MO0003833AEO5 


21877 


o 


o 


u 


A 
V 


u 


0 


M00003837D:A01 


7899 


o 


yj 


A 

u 


A 
U 


A 


0 


MO0003839A:DO8 


7798 


o 


0 

yj 




A 
U 


u 


0 


M00003844CB1 1 


6539 


o 


0 
yj 


y) 


A 
U 


u 


0 


M00003846B:DO6 


6874 


o 


o 


1 


A 
U 


A 


V 


M00003851BD10 


13595 


o 


n 
yJ 


A 
V 


A 
U 


U 


0 


M00003853AD04 


5619 


o 


n 

yJ 


A 
U 


A 


U 


0 


M00003853A:F12 


10515 


o 


yj 


A 
U 


A 

u 


U 


0 


MO0O03856B:CO2 


4622 




yj 


A 

u 


A 

u 


U 


0 


MOOO03857A:G10 


3389 


o 


o 

yj 


A 
U 


A 

u 


A 

u 


U 


M00003857A:H03 


4718 


o 


yJ 


A 
U 


A 
V 


u 


0 


M00003871C:E02 


4573 


o 


yJ 


A 
U 


A 

u 


A 


0 


M00003875B:F04 


12977 


o 


o 

yj 


A 
\J 


A 
U 


A 

u 


u 


M00003875B:F04 


12977 


o 


n 
yj 


A 
U 


A 
U 


A 




MO00O3875C:GO7 


8479 


o 


o 

yj 


A 
y) 


A 

yJ 


A 


1 

1 


M00003876D:E12 


7798 


o 


yj 


A 
yJ 


A 

V 


A 

u 


U 


M00003879B:C11 


5345 


o 


o 


A 
yJ 


A 


A 
U 


1 


M00003879B:D10 


31587 


o 


o 


A 
yJ 


A 
U 


A 
U 


A 

u 


M00003879D:A02 


14507 


o 


o 


yJ 


A 
yJ 


A 
U 


A 

u 


MO00O3885C:AO2 


13576 


o 


o 


o 

yj 


A 

u 


A 
U 


A 

u 


MO0003885C:AO2 


13576 


o 


o 


o 

yj 


n 
yj 


A 
U 


A 

u 


M00003906C:E10 


9285 


o 


o 


o 

yj 


A 
yj 


A 


A 

u 


M00003907D:A09 


39809 


o 


o 


o 

yj 


A 


A 

u 


A 
U 


MOO003907D:HO4 


16317 


0 


o 


o 


yj 


A 


A 
U 


MOO003909D:CO3 


8672 


• 0 


o 


o 


o 

yj 


A 

yj 


A 
U 


M00003912B:D01 


12532 


0 


o 


o 

yj 


o 

yj 


A 

u 


A 
U 


M00003914C:FO5 


3900 


0 


o 


o 


o 

yj 


1 
i 


A 
U 


M00003922A:EO6 


23255 


0 


o 


o 


o 

yj 


A 

yJ 


A 

u 


M00003958A:H02 


18957 


0 


0 


0 


0 


o 


o 


M00003958A:H02 


18957 


0 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


0 


0 


0 


MO00O3958C:G10 


40455 


0 


0 


0 


0 


0 


0 


MO0003968B:FO6 


24488 


0 


0 


0 


0 


0 


0 


MO00O3970C:BO9 


40122 


0 


0 


0 


0 


0 


0 


M00003974D:E07 


23210 


0 


0 


0 


0 


0 


0 
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WO 99/33982 
Table 6 All Differential Data for Libs 15-20 



done fName 


Cin«tf r 


1 1\ ri t» c in 

V^ILMIO 111 


Clones in 




ID 


LiblS 


Libl6b 


M00003974D:H02 


23358 


0 


0 


M00003975A:G1 1 


12439 


0 


0 


M00003978B:G05 


5693 


0 


0 


M00003981A:E10 


3430 


0 


0 


M00003982C:C02 


2433 


0 


0 


M00003983A:A05 


9105 


0 


0 


M00004028D:A06 


6124 


0 


0 


M00004028D:C05 


40073 


0 


0 


M00004031A:A12 


9061 


0 


0 


M00004031 A-A12 


9061 


0 


0 


M00004035C:A07 


37285 


0 


0 


M00004035DB06 


17036 


0 


0 


M00004059A:D06 


5417 


0 


0 


M00004068B:A01 


3706 


0 


0 


M00004072B*B05 


17036 


0 


0 


M0000408 1 CD 1 0 


15069 


0 


0 


M0000408 1 CD 1 2 


14391 


0 


0 


M00004086DG06 


9285 


0 


0 


M00004087DA01 


6880 


0 


0 


M00004093DB 12 


5325 


1 


1 


M00004093DB12 


5325 


1 


] 


M00004105CA04 

IVIUV/UV~ 1 W ^ \ ' mil*. v7 ^ 


7221 


0 


0 


M00004 1 08AE06 


4937 


0 


0 


M000041 1 1DA08 


6874 


0 


0 


M000041 14CF1 1 


13183 


0 


0 


M00004 1 38BH02 


13272 


0 


0 


M00004146CC1 1 


5257 


0 


1 


M00004151D-B08 


16977 


0 


0 


M00004157CA09 


6455 


0 


0 


M00004169CC12 


5319 


0 


0 


M00004171D:B03 


4908 


0 


0 


M00004172C:D08 


11494 


0 


0 


M00004183C:D07 


16392 


0 


0 


M00004185C:C03 


11443 


0 


0 


M00004197D:H01 


8210 


0 


0 


M00004203B:C12 


14311 


0 


0 


M00004212B:C07 


2379 


0 


0 


M00004214C:H05 


11451 


0 


0 


M00004223A:G10 


16918 


0 


0 


M00004223B:D09 


7899 


0 


0 


M00004223D:E04 


12971 


0 


0 


M00004229B:F08 


6455 


0 


0 


M00004230B*C07 


7212 


0 


0 


M00004269D:D06 


4905 


0 


0 


M00004275C:C11 


16914 


0 


0 


M00004283B:A04 


14286 


0 


0 


M00004285B:E08 


56020 


0 


0 


M00004295D:F12 


16921 


0 


0 


M00004296C:H07 


13046 


0 


0 
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Clones in Clones in Clones in Clones in 



Libl7 


Libl8 


Libl9 


Lib20 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


1 


0 


1 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Nsnte 


Cluster 


( y 1 nn £>c in 
VlUIICa 111 


r^lnn0C in 
L-IUlICS in 


Clones in 


Clones in 


Clones in 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


M00004307C:A06 


9457 


0 


0 


0 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 


0 


0 


0 



Clones in 
Lib20 

0 
0 
0 
0 
0 



BNSDOCID: <W0^9933982A2_L> 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


M00001340B:A06 


17062 


M00001340D:F10 


11589 


M00001341A-.E12 


4443 


M00001342B:E06 


39805 


M00001343C:F10 


2790 


M00001343D:H07 


23255 


M00001345A:E01 


6420 


M00001346A:F09 


5007 


MOO0O1346D:E03 


6806 


M00001346D:G06 


5779 


M00001346D:G06 


5779 


M00001347A:B10 


13576 


M00001348B:B04 


16927 


M00001348B:G06 


16985 


M00001349B:B08 


3584 


MO0001350A:H01 


7187 


M00001351B:A08 


3162 


M00001351B:A08 


3162 


M00001352A:E02 


16245 


MO0001353A:G12 


8078 


M00001353D:D10 


14929 


M00001355B:G10 


14391 


MOO0O1357D:Dll 


4059 


M00001361A-.A05 


4141 


M00001361D:F08 


2379 


M00001362B-.D10 


5622 


M00001362C:H11 


945 


M00001365C:C10 


40132 


MO0001370A:C09 


6867 


M00001371C:E09 


7172 


M00001376B:G06 


17732 


M00001378B:B02 


39833 


M00001379A:A05 


1334 


M00001380D:B09 


39886 


M00001382C:A02 


22979 


M00001383A:C03 


39648 


MO0001383A.C03 


39648 


M00001386C:B12 


5178 


M00001387A:C05 


2464 


M00001387B:G03 


7587 


MO0001388D:G05 


5832 


M00001389A:C08 


16269 


M00001394A:F01 


6583 


M00001395A:C03 


4016 


M00001396A:C03 


4009 


M00001402A:E08 


39563 


M00001407B:D1 1 


5556 



lones in 


Clones in 


Clones in 


Libl2 


Libl3 


Libl4 


0 


0 


0 


0 


0 


0 


4 


2 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


1 


2 


1 


0 


0 


0 


0 


2 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


2 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


0 


0 


0 


2 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



v-ioiie iodine 


Cluster jlu 


Clones in 


Clones in 


Clones 






y ikn 




LlDl4 


M00001409CD12 


OS77 


u 


A 
U 


n 
U 


MO000 1 4 1 OA D07 


700 5 


n 
u 


A 
V 


A 


M00001412BB10 


0«/«/ I 


n 
u 


A 
U 


A 


M00001415AH06 


I jJjO 


u 


A 
V 


A 
0 


M00001416AH01 


7674 


u 


A 

u 


A 

0 


i ▼! v/vvu 1 " 1 UJLJ .1111 


8847 


1 
1 


A 
U 


A 


M00001417AF09 


J OJyj 


n 
u 


A 


A 
U 


M00001418RF01 


0057 


u 


A 
U 


A 


M00001418DB06 


8596 


u 


A 

u 


A 
U 


MOOO0 1 49 1CF01 


0577 


A 

u 


A 


A 

u 


MOOOO 1493RF07 

iv i\J\J\j\j i nx.j jj . J_/V/ / 


1 JUDO 




U 


A 

0 


MO000 1 494R-O09 


1 047fi 


A 

u 


u 


rv 

0 


MOOO0 1 49 5RH0R 

JVll/UvU 1 ti. J JL> .1 1UO 


79 1 O^n 

ZZ 1 7J 


A 

u 


u 


A 

0 




4.961 
HZO 1 


A 


r\ 
U 


0 


M00001 498AH1 0 


%A 1 »9 


U 


0 


0 


MO0O0 1 470 A H04 


97Q7 


U 


rv 
U 


0 


MOOOO 1 49QR- A 1 1 




U 


0 


0 


moooo 1 49or>nn7 




U 


0 


0 


M00n0141QP-Fn» 




U 




0 




1 671 1 
lO/J 1 


i\ 
V 


0 


0 


MOOO0 1445 A -F05 


1 1^^O 
1 3J JZ 


A 
U 


A 


0 


M00001 446 A F05 
ivi v/v/ w 1 't'tu/a.. r uj 


78H1 
/ oU 1 


A 
U 


1 

I 


0 


MO0O0 1 447 A riOl 

IVI\J\J\J\J I *+*t / A.VJUj 


1 H71 7 
I U / 1 / 


A 
U 


A 
0 


0 


MOOOO 1 4480*^00 


o 
o 


/ 


6 


9 


MOOO0 1 448 rv H0 1 


161 1 1 


I 


A 


0 


MOOO0 1440 A- A 1 7 

IVI\J\J\J\J 1 *tH7A.A 1 Z 


^R^7 
DoZ) / 


A 


A 


0 


MOOOO 1 440 A -R 1 7 


41 611 


r\ 
U 


A 


0 


M00001440AO19 


1651 1 


I 


A 


A 


MOOOO 1 440 A -ri 1 0 


16515 


A 


A 

0 


0 


MOOO0 1 440C" • H06 


86 1 1 ft 


A 


A 


A 

0 


MOOO0 1450 A -A09 


10104 


A 

U 


1 
1 


A 

0 


MOOO0 1 450 A All 


19661 


A 

U 


A 


A 

0 


MOOO0 1 450 A -RI? 

JViV/WV 1 " «y Vy ./». . U 1 ^ 




A 

U 


A 


A 


MOOOO 1 450 AD08 


97950 


A 
U 


A 
U 


A 

u 


MOOOO 1 452AB04 


84^98 


A 
U 


A 
U 


A 

u 


M00001452AB12 


86850 

OOOJ7 


A 
U 


A 

u 


A 

u 


MOOOO 1452AD08 


1 190 


A 
U 


A 

u 


A 

u 


MOOOO 1 452 AF05 


85064 


A 
U 


A 

u 


A 


M00001452CB06 


16070 

1U7 / U 


1 
1 


A 

u 


A 

u 


M00001453AE1 1 


16130 


A 
V 


A 

u 


A 


M00001453CF06 

"I W W I ^TmJmJ ^m* • 1 Vy Vy 


16653 


A 
V 


A 
U 


u 


M00001454A:A09 


83103 


o 

vy 


A 
w 


A 

u 


M00001454B:C12 


7005 


0 


0 


0 


MOOOO 1454D:G03 


689 


0 


0 


l 


M00001455A:E09 


13238 


0 


0 


0 


M00001455B:E12 


13072 


0 


0 


0 


MOOOO 1455D:F09 


9283 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001460A:F06 


2448 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


M00001461A:D06 


1531 


0 


0 


1 


M00001463C:B11 


19 


17 


32 


31 


M00001465A:B11 


10145 


0 


0 


0 


M00001466A:E07 


4275 


0 


0 


0 


M00001467A:B07 


38759 


0 


0 


0 


M00001467A:D04 


39508 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:E10 


39442 


0 


0 


0 


M00001468A:F05 


7589 


0 


0 


0 


M00001469A:C10 


12081 


0 


0 


0 


M00001469A:H12 


19105 


0 


0 


0 


M00001470A:B10 


1037 


0 


0 


0 


M00001470A:C04 


39425 


0 


0 


0 


M00001471A:B01 


39478 


0 


0 


0 


M00001481D:A05 


7985 


0 


0 


0 


M00001490B:C04 


18699 


0 


0 


0 


M00001494D:F06 


7206 


0 


0 


0 


M00001497A:G02 


2623 


1 


0 


0 


M00001499B:A11 


10539 


0 


1 


0 


M0O0O1500A:CO5 


5336 


0 


0 


0 


M00001500A:E11 


2623 


1 


0 


0 


M00001500C:E04 


9443 


0 


0 


0 


M0O0O1501D:CO2 


9685 


0 


0 


0 


M00001504C:A07 


10185 


0 


0 


0 


M00001504C:H06 


6974 


0 


0 


0 


M00001504D:G06 
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0 


0 


0 
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39168 


0 


0 


0 
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0 


0 


0 
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0 


0 


0 
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3956 


0 


0 


0 


M00001513A:B06 


4568 


0 


0 


0 


M00001513C:E08 


14364 


0 


0 


0 


M00001514C:D11 


40044 


0 


0 


0 


M00001517A:B07 


4313 


0 


0 


0 


M00001518C:B11 


8952 


0 


0 


0 


M00001528A:C04 


7337 


1 


2 


2 


M00001528A:F09 


18957 


0 


0 


0 


MO00O1528B:H04 


8358 


0 


0 


0 


M00001531A:D01 


38085 


0 


0 


0 


M00001532B:A06 


3990 


0 


0 


0 


M00001533A:C11 


2428 


0 


0 


0 


M00001534A:C04 


16921 


0 


0 


0 


MO0001534A:D09 


5097 


0 


0 


0 


M00001534A:F09 


5321 


4 


7 


6 


M00001534C:A01 


4119 


0 


0 


0 
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9 
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0 
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0 
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u 
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1 
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U 
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-> 


A 

U 
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U 
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U 
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U 
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U 
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U 
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A 

u 
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U 
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U 
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/ ZJ7 
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U 
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1 
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U 
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^ 1 7S 
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1 


A 
U 


A 

U 
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A 
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1 
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A 

w 
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U 


A 
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A 
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U 
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1 
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U 
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*f 
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\j 


0 

yj 


A 
O 


MOOOOI 5 52RD04 


S708 


0 

v/ 


0 

yj 


A 
U 


M0000 1 5 53 A H06 


8208 
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0 

yj 


0 

yj 


A 
U 


M00001553BF12 


4S73 


n 


0 

yj 


A 

yj 


M00001553DD10 


22814 
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0 
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O 
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U 
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4S61 


0 
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yj 
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yj 
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0 
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1 


0 

yj 


MOOOOI S56AF1 1 


1 S77 




A 

yj 


9 


MOOOO 1556A:H01 


15855 


1 


1 


0 


MOOOO 1556B:C08 


4386 


3 


0 


1 


MOOOOI 556B:G02 


11294 


0 


0 


0 


MOOOO 1557A:D02 


7065 


0 


0 


0 


MOOOOI 557A:D02 


7065 


0 


0 


0 


MOOOOI 557A:F01 


9635 


0 


0 


0 
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plnnp No IY1P 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Lib 14 


MO0O0I557AFO3 


39490 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


M00001558B:H11 


7514 


0 


0 


0 


M00001560D:F10 


6558 


0 


0 


0 


M00001561 A:C05 


39486 


0 


0 


0 


M00001563B:F06 


102 


2 


1 


2 


M00001564A:B12 


5053 


0 


0 


0 


M00001571C:H06 


5749 


0 


0 


0 


M00001578B:E04 


23001 


0 


0 


0 


MO0O01579D:C03 


6539 


0 


0 


0 


MO0O01583D:A10 


6293 


0 


0 


0 


M00O01586C:CO5 


4623 


0 


0 


0 


M00001587A:B1 1 


39380 


0 


0 


0 
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1 


0 


0 
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1 


0 


0 
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10470 


0 


0 


0 


M00001 598AG03 
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4 


2 
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0 


0 
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0 


0 
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0 


0 
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23218 


0 


0 
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0 


0 


0 
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0 


0 


0 


M00001670C:H02 
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0 


0 


0 
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0 


0 


0 


M00001675A:C09 
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0 


0 


0 
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2 


0 


0 
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0 


0 


0 


M00001677D:A07 
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0 


0 


0 
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1 


2 


0 
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0 


0 


0 
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Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones 






Libl2 


Libl3 


UbU 


M00003958A:H02 


18957 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


M00003968B:F06 


24488 


0 


0 


0 


M00003970C:B09 


40122 


0 


0 


0 


M00003974D:E07 


23210 


0 


0 


0 


M00003974D:H02 


23358 


0 


0 


0 


M00003975A:G11 


12439 


0 


0 


0 


M00003978B:G05 


5693 


0 


0 


0 


M00003981A:E10 


3430 


0 


0 


0 


M00003982C:C02 


2433 


2 


4 


0 


M00003983A:A05 


9105 


0 


0 


0 


M00004028D:A06 


6124 


0 


0 


0 


M00004028D:C05 


40073 


0 


1 


0 


M00004031A:A12 


9061 


0 


0 


0 


M00004031A:A12 


9061 


0 


0 


0 


M00004035C:A07 


37285 


0 


0 


0 


M00004035D:B06 


17036 


0 


0 


0 


M00004059A:D06 


5417 


0 


0 


0 


M00004068B:A01 


3706 


0 


0 


0 


M00004072B:B05 


17036 


0 


0 


0 


M00004081C:D10 


15069 


0 


0 


0 


M00004081C:D12 


14391 


0 


0 


0 


M00004086D:G06 


9285 


0 


0 


0 


M00004087D:A01 


6880 


0 


0 


0 


M00004093D:B12 


5325 


0 


0 


0 


M00004093D:B12 


5325 


0 


0 


0 


M00004105C:A04 


7221 


0 


0 


0 


M00004108A:E06 


4937 


0 


0 


0 


M00004111D:A08 


6874 


0 


0 


0 


M00004114C:F11 


13183 


0 


0 


0 


M00004138B:H02 


13272 


0 


0 


0 


M00004146C:C11 


5257 


0 


0 


1 


M00004151D:B08 


16977 


0 


0 


0 


M00004157C:A09 


6455 


0 


0 


0 


M00004169C:C12 


5319 


0 


0 


0 


M00004171D:B03 


4908 


0 


0 


0 


M00004172C:D08 


11494 


0 


0 


0 


M00004183C:D07 


16392 


0 


0 


0 


M00004185C:C03 


11443 


2 


0 


0 


M00004197D:H01 


8210 


0 


0 


0 


M00004203B:C12 


14311 


0 


0 


0 


M00004212B:C07 


2379 


0 


0 


0 


M00004214C:H05 


11451 


0 


0 


0 


M00004223A:G10 


16918 


0 


0 


0 


M00004223B:D09 
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0 


0 


0 


M00004223D:E04 
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0 


0 


0 


M00004229B:F08 
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0 


0 


0 
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T ih1 1 


T ihl % 




M00004230BC07 


7212 
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4905 
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o 


n 
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o 


0 


o 
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o 


o 


o 

u 
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o 




n 

V 


M00004295D:F12 


16921 


o 


o 


0 


M00004296C:H07 


13046 


0 


0 


0 


M00004307C:A06 


9457 


1 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 
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We Claim: 



10 



15 



20 



25 



1. A library of polynucleotides, the library comprising the sequence information of 
at least one of SEQ ID NOS: 1-844. 

2. The library of claim 1, wherein the library is provided on a nucleic acid array. 

3. The library of claim 1, wherein the library is provided in a computer-readable 

format. 

4. The library of claim 1, wherein the library comprises a differentially expressed 
polynucleotide comprising a sequence selected from the group consisting of SEQ ID NOS:9, 
39, 42, 52, 62, 74, 119, 172, 317, and 379. 

5. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human breast cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 
74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 
379, 384, 386, and 388. 

6. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human colon cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 
176, 241, 288, 317, 357, 362, and 374. 

7. The library of claim 1 , wherein the library comprises a polynucleotide 
differentially expressed in a human lung cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 
135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

8. An isolated polynucleotide comprising a nucleotide sequence having at least 90% 
sequence identity to an identifying sequence of SEQ ID NOS: 1-844 or a degenerate variant 



thereof. 
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9. An isolated polynucleotide according to claim 8, wherein the polynucleotide 
comprises a seqeuence encoding a polypeptide of a protein family selected from the group 
consisting of: 4 transmembrane segments integral membrane proteins, 7 transmembrane 

5 receptors, ATPases associated with various cellular activities (AAA), eukaryotic aspartyl 
proteases, GATA family of transcription factors, G-protein alpha subunit, phorbol 
esters/diacylglycerol binding proteins, protein kinase, protein phosphatase 2C, protein 
tyrosine phosphatase, trypsin, wnt family of developmental signaling proteins, and 
WW/rsp5/WWP domain containing proteins. 

10 

1 0. The polynucleotide of claim 9, wherein the polynucleotide comprises a 
sequence ofoneofSEQ IDNOS:24,41, 101, 157, 291, 305, 315, 341, 63, 1 16, 134, 136, 
151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 382, 169, 23, 291, 324, 330, 
341,353, 188, 379, and 395. 

15 

1 1 . The polynucleotide of claim 8, wherein the polynucleotide comprises a 
seqeuence encoding a polypeptide having a functional domain selected from the group 
consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 
bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 

20 zinc finger (CCHC class), and zinc-binding metalloprotease domain. 

12. The polynucleotide of claim 11, wherein the polynucleotide comprises a 
sequence ofoneofSEQ ID NOS: 116,251,374, 97, 136,242,379, 306, 386, 18, 335,61, 
306, 386, 322, 306, and 395. 

25 

1 3. A recombinant host cell containing the polynucleotide of claim 8. 

14. An isolated polypeptide encoded by the polynucleotide of claim 8. 
30 1 5. An antibody that specifically binds a polypeptide of claim 1 4. 

16. A vector comprising the polynucleotide of claim 8. 
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17. A polynucleotide comprising the nucleotide sequence of an insert contained in a 
clone deposited as ATCC accession number xx, xx, xx, xx, xx, xx, xx, xx, or xx. 

18. A method of detecting differentially expressed genes correlated with a cancerous 
5 state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample derived 
from a cell suspected of being cancerous, where the gene product is encoded by a gene 
corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 65, 66, 68, 
74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219, 223,258, 298,317,338, 
10 379, 384, 386, 388, 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 288, 317, 357, 362, 374, 9, 34, 42, 
62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349,361,369,371,379, 395, 381, and 400; 

wherein detection of the differentially expressed gene product is correlated with a 
cancerous state of the cell from which the test sample was derived. 

15 19. The method of claim 1 8, wherein said detecting step is by hybridization of the 

test sample to a reference array, wherein the reference array comprises an identifying 
sequence of at least one of SEQ ID NOS:l-844. 

20. The method of claim 18, wherein the cell is a breast tissue derived cell, and the 
20 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 



162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388. 

21 . The method of claim 1 8, wherein the cell is a colon tissue derived cell, and the 
25 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ IDNOS: 1,39, 52, 97, 119, 134, 172, 176, 241,288,317, 357, 362, and 
374. 

22. The method of claim 18, wherein the cell is a lung tissue derived cell, and the 
30 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 



least one of SEQ IDNOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 114, 123, 144, 130, 157, 
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SEQUENCE LISTING 



<110> Lewis T. Williams 
Jaime Escobedo 
Michael A. Innis 
Pablo Dominiguez Garcia 
Julie Sudduth-Klinger 
Christoph Reinhard 
Klause Giese 
Filippo Randazzo 
Giulia C. Kennedy 
David Pot 
Altaf Kassan 
George Lamson 
Radoje Drmanac 
Radomir Crkvenjakov 
Mark Dickson 
Snezana Drmanac 
Ivan Labat 
Dena Leshkowitz 
David Kita 
Veronica Garcia 
William Lee Jones 
Birjit Stache-Crain 

<12 0> Novel Human Genes and Gene Expression 
Products I 

<130> 2300-1480P 

<140> 09/ 

<141> 1998-12-21 

<150> 60/068, 755 
<151> 1997-12-23 

<150> 60/080,664 
<151> 1998-04-03 

<150> 60/105,234 
<151> 1998-10-21 

<160> 844 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 1 



tctcccctga gctgcaggcc tgcatatcca gtaggtctac 
tgnggaggaa cctctggctt gctcattaag tcctactgat 
ccccacttat ttttgtcttt cactatcgca ggccttagaa 
cttacctagt ccagtctacc ccctggagtt agaatggcca 
tcacattact cccttcagtg atttcttgta gaagtgccaa 



tggacatctg tactggttgt 
tttcactatc ccctgaattt 
gaggtctacc tgcctccagt 
tcctgaagtg aaaagtaatg 
tccctgaatg ccaccaagat 



60 
120 
180 
240 
300 



<210> 2 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 2 

cccagctgct caggaggctg aggcaggaga attgcttgaa cccaagaggc ggaggttgtg 6 0 

gtgagccgag attgcacctt tgtactccag cctgggcaac gagcaaaaaa ctctgtctca 12 0 

aacaaaaaga agaaaaaaaa aaaaaaaann nnnnnnnnnn nnnnnnnnnn nnnttnttct 18 0 

ggcgncnagt cccaaantcn taccttgtaa gacctttann tnncctgngg tntttntnna 24 0 

cncttanata nnnntntttn ctatcaanta tagggagant tttcntttng gggcaactt 299 

<210> 3 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 3 

atacgattcg aacnnggaca agacgagtat ggaataatat cccactnnnt ttacaatact 6 0 

ganattatgc ngngatagng cttgttccat tcnaccagcg aatnatgcat tnacncnaca 12 0 

cnngagttac tatccaaaca cacgttttca cgntacctga ngctggtnga naattatgcg 180 
accatgaggc tttccangat ntttctannt ancagacngn gnacaatgnt gaanaagcng 24 0 

tacacaccgc nctngncnnc cnncactgan cangtnacnn ngctcactgn ngcctctttc 300 

<210> 4 
<211> 287 
<212> DNA 

<213> Homo sapiens 
<400> 4 

aaancngcac gangccacgt ncgnnnngnt nntactnnnc natngccncn tcantggcng 6 0 

ncagctagac gcctaacagc cgangancca nccntnntgt ganccngtcn tgacngnnag 12 0 

cntgccggtc ttgctcnttt tgtctaccnn gagganannn ntntgggaca tcccagactg 18 0 

agtgaggaga tctgcngctg cnnctgtact tggttacanc ncacacgang actntncctt 24 0 

ggactanana cactagccta anattcngca ctacctantc ctctggc 2 87 

<210> 5 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 5 

gtccttttga accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga 60 
ctgttctact ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag 120 
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gcccggggcg accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc 180 
gaatttcgac ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc 240 
caacccgaga tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc 300 

<210> 6 

<211> 284 

<212> DNA 

<213> Homo sapiens 

<400> 6 

tntccccttt gacgccttan tgccctnncg ctacnngtcc nttaggcctt atcccatcgn 60 

ccntcgtttn gcattctgcc nnagantgac tttncnatca tgcntnatnn gtnnttacna 12 0 

ggggctnggg tgaattntta caccctgcna ntccatanca cantgccntg cnagctncac 180 

cctcntgaat aaatgcaata aantttcngt tgatcttata caccttatgc nccntantta 24 0 

atcagccctn tnttacnana tcnanttatg cnggtattaa aaca 284 

<210> 7 

<211> 277 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gtgctgcaga caacacacct tcctgatgga ggtgtccggc tgatggagaa gtctgtgggc 6 0 

ttgtaaatca tctttgatgt taaccaggcc gacgctgtgg ccacattccg aaagattaac 12 0 

cctgtcaaac cctannnnnn nnnnnnnnnn nnnggatttg atnagcctgt nccanacctc 180 

tgcagcctcn ancggtngtn ntaccatagt ggggatgacc ctctgatact ttgncctggt 240 

ngancatgnt gacanntgct tctacagctt nngggac 277 

<210> 8 

<211> 292 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cttgggaggc tgagtcagga gaattgcttg agcccaggag atggaggttg cagtgagcca 6 0 

agatcatgcc actgcactcc agactgggca acagagggag actccgtctc aaaaactaaa 120 

aaaaaaaata catttagtat agcggggggt gggcgggaga aataatgtta tttcctatgc 180 

aaatgacgnn nnnnnnnnnn cccatggtaa atgtnaatat actgcgtctn ttttgggana 24 0 

gccttttant aaangagtct tanatgaatc tctanntnat gantttaact tg 2 92 

<210> 9 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa gaaacttcaa 60 

gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg tcttctaaat 120 

ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag agtacggtgc 180 

cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat atatcacctt 240 

ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg accaagtgtt 3 00 
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<210> 10 
<2X1> 300 
<212> DNA 

<213> Homo sapiens 



<400> 10 

aggaggcgga gcttgcagtg agctgagatc gcgccactgc actccagcct gggcaacaga 6 0 

gtgagactct gtctcaaaaa aaaaaaaaaa nnnnnnnnnn nnnnnnnnaa nctcgtnttn 120 

gnaaggaaan ggggnaangg accggtntta tncctatgtn gtntttgcag gcaaangaaa 180 

nggacccntt tttgtaaaaa aaagtctttt gnncaantaa acggggtntg ngggtncagg 24 0 

ccctggnggg gcncncantt gcctggnggc ttntgnnaaa tcggnaaagg gaggaaaggc 300 



<210> 11 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 11 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 60 

ggcggtttca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 12 0 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nncgggttct cccaaattnt 180 

ttnnaggggn ccatggncaa ctgnttnacn tttgtttngg naaccccntg cccnaagncg 24 0 

cananaggct gtnnttnncc ttgttnccaa ggntgaggan caaaaagtac cctntgtttt 3 00 



<210> 12 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 12 

caaagatggt cgtattacta aaggtgaata accagcgcgg ggggcacgtg gagtcactgg 60 

aacatttgtg caatgcfcggt gggaatgtca acccgtgcgg ccctctggaa taagcctggc 12 0 

agctcctcca agagttaccg tgtgacccag caattccact cctagctcca cccacaggaa 180 

ttgaaagcaa agacgcaaac agatgcctgt gcaccaaagt tcacggcagc atccttcgcc 240 

atagtggcag catccgtcgt cacagcggca tcatccttca tcatagcggc agcatccgtc 300 



<210> 13 

<211> 278 

<212> DNA 

<213> Homo sapiens 



<400> 13 

cctgcagcca ctaatgcatt gtgtatgata acaaaaactc tggtatgaca cattttctgt 60 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 12 0 

ggggaggtgt gggaggtttt nncgnnannn nnngcnnnnn annccccggn nngnnngaag 18 0 

ctgnnnnttn naannngcnn nnnannatga naannnccnn ngactggnnn nangaggcct 24 0 

ancccntgnt ttananaaac nnncnncagn ntctctca 278 



<210> 14 
<211> 300 
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<212> DNA 

<213> Homo sapiens 

<400> 14 

gtgtcttcat cttacccagt ggaacctaag aaattaaatt ctccagaaga aactgctttt 60 

cagacaccaa aatctagcca gatgcctcgg ccttcagtgc caccattagt taaaacatca 120 

ctgttttctt caaaattatc tacacctgat gttgtgagcc catttgggac cccatttggc 180 

tctagtgtaa tgaatcggat ggctggaatt tttgatgtaa acacctgcta tgggtcaccg 240 

caaagtcctc agctaataag aagggggcca agattgtgga catcagcttc tgatcagcaa 300 

<210> 15 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 15 

gttatattaa attattcttt gtttttcttt ttcttttaat aaagcctgca agttactaaa 60 

ttgtagtttc ataaattctg tagtaaagta tcatcttggc agtgtgccaa aggtgaaaat 120 

gatgctttct ctaacagaga aattcttagt gactccagtc gtagaaaaac gtctttacaa 180 

cctgaataag attgaagaat tgtgaacata ccatggccta ttggatgaat catttgccgt 24 0 

aggctaaatc agactgtagg gtttgcgatg gatttatgga gtatgtgggt atagaaatca 300 

<210> 16 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<400> 16 

gtttcattta agaagaatga gctagataaa tgtgctcttc tggttacccc accctgacag 6 0 

agtgcatttt tacacggcta gcaggggttg agactgcagc ctggcctnnn nnnnnnnnnn 120 

ngnnnnnngc nnacttnact tcccngaanc actataattg gnanacnttn ctaannggtn 180 

atctngccga cctgnnagat anactcnnga taaaanccnn tgcagaaagc gcccttccat 24 0 

gtcangcncn tnaganacnn ncntaccncc tangna 2 76 

<210> 17 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 17 

ggtgcccatc accacaccca gctaactttt gtatttttag tagagacggg gtttcaccat 60 

gttggccagg ctggtcttga actcctgacc tcgtgatccg cccgccttgg ccccgcaaag 120 

tgctgggatt acaagcatga gcccagcgcc tggctgtatc tttcatttta cccaagtcac 18 0 

tttacccaag taagtaatta ggggaaagcc tgagtcttgt accacctgtt catttgggga 24 0 

actgtgggaa acggagccaa cggacctaag tgccctttga cagtgagttt cataccattt 300 

<210> 18 

<211> 273 

<212> DNA 

<213> Homo sapiens 
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<400> 18 



ctcagctgag gcaattaaac 
gtacagaagt tgggggactg 
ataccagtcc ttttacagat 
tgtagagaaa ttgcagttca 
agatgtcaag caagtactgg 



tggaaaagaa atagattgaa 
aaggagaggg agccactgca 
ataatagata cagcttctga 
gaactggagc atgcagttag 
aaaatgctga act 



aagatactac 
ggtgctagct 
ggtggagggt 
gcaagaggca 



agaagaagca 
gcttaagggg 
gataggagtg 
tcccatgtga 



60 
120 
180 
240 
273 



<210> 19 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagttaaggg 6 0 

tcttattaaa agccaccact ttgctgaggc ctgtacaggc cttgggggtt tggggaagag 12 0 

aaataaggca ggcacttgtc ccttcaggga gggacttgtc cctcactggg aggtttgggg 18 0 

ttgaccttgg ctccagcaga gatacccagc ctggcgtgga aggggcaggt ctgagcttac 24 0 

gcttgactgc agggcaagct gcaggcctct tctgccttcc cctgcattca ccaaggacag 300 

<210> 20 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 20 

atggcatgca ctgacctctt cttggagccc agaactttat agagttgcct accagggtta 60 

ctgtaatgga atttatgatc ttaagaaatt actagttgta ttatttatcc tatgattcat 120 

tcattcaata agcttttact gcataaactt tacatccagc actgtagtta agtacccaaa 18 0 

attgaataga aataatggct tttgaaaatc gcacaaagca ggccaggcac ggtggctcac 24 0 

gcctgtaatc ccagcatttt gggaggccga ggcaggcgga tcacgaggtc aagagatcca 3 00 

<210> 21 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<400> 21 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 6 0 

ggcggttgca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 12 0 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnccttng gncgggttnt cccaaattnt 18 0 

tttgaggngn ccatggncaa ctgcttnanc tttgttttgg caaccccntg cccnaagtcg 24 0 

catataggct gtncttcacc ttgtttccaa ggctgnggaa canaaagtaa cct 2 93 

<210> 22 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 22 

ctggtctcga acacctgacc tcaggtgatc cattcgtctt ggcctctcga agtgctggga 6 0 

ttccaggcgt gagccactgc ggccagcaca tttccacttt tagatcctac tccataccac 120 



6 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 




PCT/US98/27610 



aggtttcatt taagaagaaa gagctagata aatgtgctct tctggttacc ccaccctgac 
agagtgcatt tttacacggc tagcaggggt tgagactgca gcctggcctg ccagccattg 
gaggtgttta aggaagggca gataatgtga ctctttgcgg ggtgccatct gcttacccat 



180 
240 
300 



<210> 23 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gaaccaaaga cgtgtatgga gtgttctctt gtccttatcg acttgctctg ctcccagctt 60 
tccaagcgac cggatctgag tgatgcttct agaacatttg ggtgttgggg ggttcccaat 12 0 

agtagaaagg gtccccattc ctgctcagca ccgcacctct ctaccccccc acagacacac 180 
atgcagacac acacatgcag acaacacgca gacacacaca tgcaggcact cacatgcagg 240 
cccatgcaca cacacgtgca cacacatgca gagacatgca gacacgcagg cacacatgca 300 

<210> 24 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 24 

cctcccacaa cacgtgggaa ttcaagatga gatttggttg gggacacagc caacccatat 6 0 

cacccatgcc tggatgccct tctcatgctt gggttctgtc atctgcacca ggccttctgc 120 
tgcccgtctg tcttacccac caggactctg actctccacg ctgggccacc tctcttctcc 180 
aacactgcta tggattgaat gtttatgtta tccccaaatt tgcatgttgc aatcccaatc 240 
tccaatgcca tagtattagg aggtgggggc ctttgggagg tgatttggtc atgaaggtgg 3 00 

<210> 25 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 25 

ggaaaatgaa atctgactat ctgctagttg ccaaaaccca gaaacattcc tgtgtaatgg 6 0 

ttagttggga aagaaggcag cacttgaaaa aatttaccag gttcctcact gggagatgtg 12 0 

ggaaggggcg tgggacgcac gcggtcactc cctctcagcc ccccacattt ctagaacaca 180 
ctgtagctgt gcctctacag actcccgctg cctggcctcc acagatcctg ctcagattca 24 0 

ccagtaggca aagcttggcc ctattagctt tttctctcca tggctctgtg ggaatgtgcg 300 

<210> 26 
<211> 300 
<212> DNA 

<213 > Homo sapiens 
<400> 26 

ctgcagtgag attctctgca atgactggcc tcagcaaggg ggcagcttag gaccctgaca 6 0 

tcccaggtca ctaagccaca taggataagt aatgggtgga cagaagcggg aaaggagaag 12 0 

ggcagggcac atgtttaaaa cttgaacttt ctgaggctaa gactggaaaa ggaatggttt 18 0 

cagctgatat atttggatac cagttgacta tttttaggaa aaaaacacaa atggctttta 24 0 
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aacatcacag tgtgatacag tctaactcag aattagagac aggcaaaaca gaactccatc 300 

<210> 27 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gtactgcttc tgtggctctt cacagacctc acggatgtga ccggagatga 
ccacgtttta aaggagaaag agagctcctg gtggggccct cggggtggtc 
tttgcagtct gcaacagtga cgcgcagccc ggtccggagc gtggtgagct 
tctgggtcag ctttcgctgt gtctcctgtg tgtgttagaa tccagagccc 
caagcgggtc ctccgccaac ggggagagcc tcttcgcggc gctgttggcg 

<210> 28 
<211> 298 
<212> DNA 
<213> Homo sapiens 

<400> 28 

aangnaannn nggggngttg antcnacctn ngaaccgtgt 
antaganntt gggcagganc agagngaggc caagntacgg 
tggggnnnnn nnangnnaan tnnngaaggg gngngannga 
ngagggcgaa ngncaggann nagaaaannn ggggacgana 
gaannnngga gnnnnnanag atgncgggca gngncngngn 

<210> 29 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cctcagcccc acaccagctc tatttcaggg gtgagagtca 
ttcatgggat ttcgattcga agatcctaga ccagggagac 
caaaatacta ggtaagtcac tgcagaccga cctccctgca 
ttgtggagaa tcagagcatc ttgacatgac tgctgaccta 
gggatcctgt ggaacctctt ctagttcagg ggtgtgagca 

<210> 30 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 30 

gtttgtttcc ccgagatgtg aacttgctga aggaaaacag 
agagaactgt cagctcttca ggatgtgaag gcaagaggaa 
gcatgttggt taactgccct gcctactaca gtgtgtctgc 
acaaaatcaa agagatgcca nnnnnnnnnn nntgaggaag 
gaaaagaagg ctgatctcat tggaagtctc acccacaagc 



gtgccgatga 6 0 

tcaggtccca 12 0 

ttgtttgcct 180 

agaggaag t g 24 0 

acagcacgct 300 



anaaacccat ggaaacagct 6 0 

gggaggcnag gagcngagan 12 0 

gggggggana naagggggga 18 0 

nggngaacag ggnnnaaacg 24 0 

aggnganann ngagacgg 2 98 



gagagcactg caatatgtgc 6 0 

actgtgagcc agggatacaa 12 0 

gtttgggaaa gaagctgggt 180 

aagatccctg gcattggcca 24 0 

ttagactgcc agttgtctag 3 00 



tgtaaagagg 


aaggccatac 


60 


tgaagacaag 


gaagcagtga 


120 


tcccaaggct 


gagctactga 


180 


aggaacaggc 


anatgtcaat 


240 


tggagaccct 


ccaggaggcg 


300 



<210> 31 
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<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 31 



tttaaactga gctccaaatg acgttcaaac acccctctcg ggtagagttt tcatggtgga 
acggttgcgc ccaccaaaca gaagcttatg tttttggcac agaaggcctg ggccattttc 
atggacacct ggctggacct cggtggaagt gaactccgta ggttgttgcg ttcactgcag 
cacctcacat gataccgtcc cctctcatgg aacggagcct cccccatgca gcccccactc 
aaatggagtt ttaaaggctg ggttcaggtt acgggggcgt ttctcaccgt ctgaatgcgg 



60 
120 
180 
240 
300 



<210> 32 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 32 

gtgaaacaga aagtggagat gctttccttg acctgaagaa gcctcctgcc tccaaatgcc 6 0 

cccatcgcta tacaaaagaa gaactcttgg atataaaaga actcccccat tccaaacaga 12 0 

ggccttcatg cctttctgaa aaatatgaca gtgatggtgt ctgggaccct gagaagtggc 18 0 

atgcctctct ctacccagct tcagggcgga gctcaccagt ggaaagtctg aagaaagagt 24 0 

tggatacaga ccggccttcc ctggtgcgca ggatagtaga tccacgagag cgtgtgaaag 3 00 

<210> 33 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 33 

gtctgattga agctgttcag gtttatcatg caaatcctcg cctctggcta cggctggctg 60 

aatgctgcat tgctgccaat aaggggactt ctgaacaaga aactaaaggc cttcccagca 12 0 

aaaaaggaat tgtacagtct attgttggtc aaggctatca tcgtaaaata gttttggcat 18 0 

cacagtctat acagaatact gtttataatg atgggcagtc ttcggccatt cctgtagcca 24 0 

gtatggagtt tgcagccata tgtctcagaa atgccttgtt gctgctacct gaagaacagc : 300 

<210> 34 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 34 

tgacagagct gttcagcgta caccagatcg atgagctggc caagtgcaca tcagacactg 60 
tgttcctgga gaagaccagt aagatctcgg accttatcag cagcatcacg caggactacc 12 0 

acctggatga gcaggatgct gagggccgcc tggtacgcgg catcattcgc attattaccc 180 
gaaagagccg tgctcgccca cagacctcgg agggtcgttc aactcgggct gctgccccaa 24 0 

ccgctgctgc ccctgacagt ggccatgaga ccatggtggg ctcaggtctc agccaggatg 300 



<210> 35 
<211> 300 
<212> DNA 



<213> Homo sapiens 
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<400> 35 

cttttttaag caaagcagtt tctagttaat gtagcatctt ggactttggg gcgtcattct 60 

taagcttgtt gtgcccggta accatggtcc tcttgctctg attaaccctt ccttcaatgg 120 

gcttcttcac ccagacacca aggtatgaga tggccctgcc aagtgtcggc ctctcctgtt 180 

aaacaaaaac attctaaagc cattgttctt gcttcatgga caagaggcag ccggagagag 240 

tgccagggtg ccctggtctg agctggcatc cccatgtctt ctgtgtccga gggcagcatg 300 

<210> 36 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 36 

gctggccaaa gccaaatctc ctaagtccac cgcccaggag ggaaccctga agcctgaagg 60 
agttacggag gccaaacatc cagctgcagt tcgcctccaa gaaggggtcc atggccctag 120 
tcgagtccat gtgggctctg gggaccatga ctattgtgtc cggagcagga cccccccaaa 



180 



aaagatgcct gccctagtca ttccagaggt gggctcccga tggaatgtca agcgccatca 240 
ggacatcacc atcaaacctg tcttgtcctt gggcccagct gcccttcgcc ccatgcatag 300 

<210> 37 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 37 

gtccaaggac aacttcgaga catttctttt tgccaccgta tctaacaggg agcaggaaga 60 
tctctgccga ggaattgtcc agctctgctt caatgagcaa agccaacagc tgctagcaga 120 
ggtccagccc tctgactctt tcctcatggt agagacaact gcatactttg aggcctacag 180 
gcacgtcctg gaaggactcc aggaggtcca ggaggaagat gttcccttcc agaggaatat 24 0 

cgtggagtgt aactctcatg tgaaggagcc aaggtacttg ctaatggggg gcagatatga 300 

<210> 38 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 38 

catccaggga gaacctcggg gctgggacac ctcctggccc tcaccctggg tcatgtttac 6 0 

agtcctcagt gccccacacc ggtggccccc tgaggacacc tccaccctga ccttgatttt 12 0 

cccaaacgct gcctcttggt gacagactca gcccaaaacc ccttccttct gtctctggag 18 0 

acccttgagc ttggggaaat atggaggggt gtgtgtctgc aatcaaggcc tctgcagctc 24 0 

acggctggcc cggtgggctg ggacttccgt atgaattnta aatacttagg gttcattttt 300 

<210> 39 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 39 

gggaaggagc gggcgtgagg ccagctgagg catggtgacc cctgggaagg agcgggcgtg 6 0 

aggccagctg aggcatggcg acccctggga aggagcgggc gtgaggccag ctgaggcatg 12 0 
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gtgacccctg ggaaggagcg ggcgtgaggc cagctgaggc atggtgaccc ctgggtacgg 
gggacttggg ggccgcacct tggtttgccc agggcccctc ctgcaccacg ggccacatgc 
ggaggacggc gtgggatagg ctccctgggt ccacagcttc tgcccgtgta tggggaaccc 



180 
240 
300 



<210> 40 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 40 

ccaaaagctt gtggcaaatt tgaaatttct gccattaggg accttacaac tggctatgat 6 0 

gatagccaac ctgataaaaa agctgttctt cccactagta aaagcagcca aatgatcacc 120 

ttcacctttg ctaatggagg cgtggccacc atgcgcacca gtgggacaga gcccaaaatc 180 

aagtactatg cagagctgtg tgccccacct gggaacagtg atcctgagca gctgaagaag 24 0 

gaactgaatg aactggtcag tgctattgaa gaacattttt tccagccaca gaagtacaat 3 00 

<210> 41 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 41 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 
gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 
ttcgaatctg ggattccgct gtgcagccga ccgcctgccc actatggact gacaaccaag 180 
gaaagtcttc cccagtccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 240 
ctttgaacga tcccatgcaa agaattccca ccctgaggtg ggttacatac ctgcccaatg 300 

<210> 42 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ttctaagtca ggagtacagt acaaaggaca tgtggagatc cccaatttgt ctgatgaaaa 6 0 

cagcgtggat gaagtggaga ttagtgtgag ccttgccaaa gatgagcctg acacaaatct 12 0 

cgtggcctta atgaaggaag aaggggtgaa acttctaaga gaagcaatgg gaatttacat 18 0 

cagcaccctc aaaacagagt tcacccaggg catgatctta cctacaatga atggagagtc 24 0 

agtagaccca gtggggcagc cagcactgaa aactgaggag cgcaaggcta agcctgctcc 3 00 

<210> 43 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 43 

gccaccgaag cttcaggatg acatcttaga ctctcttggt caggggatca atgagttaaa 6 0 

gactgcagaa caaatcaacg agcatgtttc aggccccttt gtgcagttct ttgtcaagat 12 0 

tgtgggccat tatgcttcct atatcaagcg ggaagcaaat gggcaaggcc acttccaaga 18 0 

aagatccttc tgtaaggcr.c tgacctccaa gaccaaccgc cgatttgtga agaagtttgt 24 0 

gaagacacag ctcttctcac ttttcatcca ggaagccgag aagagcaaga atcctcctgc 300 
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<210> 44 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 44 

ggcttataca acatagtggg gaacgcatgg gaatggactt cagactggtg gactgttcat 6 0 

cattctgttg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 12 0 

aaaggtggat cctacatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 180 

agccagaaca cacctgatag ctctgcttcg aatctgggat tccgctgtgc agccgaccgg 24 0 

ctgcccacta tggactgaca accaaggaaa gtcttcccca gtccaaggag cagccgtgtc 3 00 

<210> 45 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 45 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt taacagacac 

atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 12 0 

tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 180 

cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag cgtcggtaaa 240 

ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt tgttggccag OAn 



60 



300 



60 



<210> 46 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 46 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt tangngacac 

atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 120. 

tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 180 

cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag cgtcggtaaa 240 

ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt tgttggccag 3 00 

<210> 47 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 47 

acacagataa ttttaataca atgtgaaaaa gtgtatgggt gtgtagaaga ggggttctta 6 0 

gagtttctgg agagaatgat tctgagctcg gttttgacaa aagaggagct gctgaggcta 120 

aaagtggatg aaaagggcct tataattaaa agaaacaaga caggactcag aggtgtgaaa 18 0 

caaatattat gcatggtgaa ttacaatgag ttgggggtat tctgtagccc taaagtacaa 24 0 

ggtataaaga gacagaaaat gatcctggaa tatagacaga ggatacttca tctctcatga 300 

<210> 48 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 48 

gatggaacat gagtggaagt gggcagtctt tttctttccc tatcagctga gtgaatgaag 6 0 

atttagaggg cagcagagtc atgacatgga tgacgttggg tctctggatg gctaaatgga 12 0 

agacccgccc cccaacgcca ctctaccccc ctgctttgaa ctatgctttg agaaatgagc 180 

ttatgagacc actgagactt gggggctgtt tgttcagcag ttcacctaca cttattagga 24 0 

aaggttgact tcttgtaact acgcctttcc ttaaatcatc ttttgtataa ttctcagaag 300 



<210> 49 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 49 

ccctccccgg cttcccccgg agtgggtcac cacactgttt tttatcatca tgggaatcat 6 0 

ttcattgact gtcacatgtg gtttgctggt ggcttcccac tggcgaagag aagctacaaa 120 
atatgctcga tggatagcat tcactggaac cactatgaga agattatagg aaaaacacca 180 
agactagagg actctgggtt ccttttatgc aaagtcaact cttctgggtc acagttaccc 24 0 

agcaacaaaa ataaagagag gaccaggacg atgccagcac cccgtttatc ctgagtgaac 3 00 

<210> 50 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 50 

ctcctgtctc agcctcctgg gtagctggga ctacaggtgc atgccaccat gcctggctaa 6 0 

cttttgtatt tttagtacag acagggtttc accacattgg tcaggctggt ctcgaactcc 120 
taacctcagg tgatccacct gccttggcct cccaaagtgc tgagattaca ggcgtgagcc 180 
accgcgcctg gcctgattgg ttttttaaca tgatttttct ctaagcttaa ataccacaag 240 
gccaaagaga aatggtcata atttaaacca ttattatatt ggtgaggtat ccctagctat 3 00 

<210> 51 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 51 

ggaggctaga ctcaagctgt ctggagagtg tgaaacaaaa gtgtgtgaag agttgtaact 60 

gtgtgactga gcttgatggc caagttgaaa atcttcattt ggatctgtgc tgccttgctg 120 

gtaaccagga agaccttagt aaggactctc taggtcctac caaatcaagc aaaattgaag 180 

gagctggtac cagtatctca gagcctccgt ctcctatcag tccgtatgct tcagaaagct 240 

gtggaacgct acctcttcct ttgagacctt gtggagaagg gtctgaaatg gtaggcaaag 3 00 

<210> 52 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 52 

atatggtata gttggaaata ggttattgtg agttatttgt agtcatgtct ttaatggccc 6 0 

ttgcatggtg tctaacttct gcaataaatg atctgccagt cctagtgtct ggctttatgc 120 

aatttgtttt cctttgtgga tgaagtggga gtaagacttg ttgctgtgag gattagatga 18 0 

agtggctagg atatggacac actttacttg aattggaaaa caagccatgt atccctaatc 240 

tgcaaaatgt ggcatgtcac acgtgtaatc tctgaggttt agtttttgct caagattgca 300 

<210> 53 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 53 

aagaagctct gcttggtact actattatga acaacattgt tatttggaat ttaaaaactg 6 0 

gtcaactcct gaaaaagatg cacattgatg attcttacca agcttcagtc tgtcacaaag 120 

cctattctga aatggggctt ctctttattg tcctgagtca tccctgtgcc aaagagagtg 180 

agtcgttgcg aagccctgtg tttcagctca ttgtgattaa ccctaagacg actctcagcg 240 

tgggtgtgat gctgtactgt cttcctccag ggcaggctgg caggttcctg gaaggtgacg 300 

<210> 54 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaagatgcc aatttccatg aagtcttgat ttatatatat gtacacatgt tatgcacata 60 

catgtttgtt ttctaacagt tattttttaa gcttttgaga taattttaga cttacagaag 120 

agttgtaaaa gtagtagagt tcttgtatac tctgcaccca ccttgccctt atgttaacat 180 

cttacgtaac aatagaacat ttgtcaaaat taagaaatta accttgatat aatactaact 24 0 

aaagtagaaa gtttaaaaag tagagatttt agtcttttca ctaatgtcct tttactgttc 300 

<210> 55 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gggagggacc cttgggggca ggttgtgggt agccagttgc agtctgtggc ctccctcaga 60 

ggtttggagt cgggcgtggc atgctgctgt tggcctcttt ccgagggagt gccatccact 12 0 

ccctgtccca ccgctgtccg cggtgaggac agtgagggca gtgctacgtg gtggggaggt 180 

gtgtgagaag ccacggaagg gcttcacagg gcagatgcca aggccagtgg gccccggaca 240 

gagtcaggct ccctgggcgg ccttgtgtct tggtggccct gatcatcctg ccaatgcaaa 300 

<210> 56 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ctttgctctc tccattccaa gttgttctct gttctagaaa gcagatgtag tagacatcta 60 
ctgtttttgc ctaaacagaa tccctttttc ctttttttgt taaaagtact catccctaat 120 
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attacattgt tctggaagga ctgaaaataa 
aatcagatta tttcattcct cagcaaacgg 
cttctttggt gttggcatat ggaccctgag 



cagaactcag caccatgatc ggaccgggac 
agatcgatcc gaaaagtgga aatatgagct 
agaaagaact ttaatttttt ctcttggact 



180 
240 
300 



<210> 57 
<211> 276 
<212> DNA 

<213> Homo sapiens 
<400> 57 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 60 

ccacccccca cgcccgaaca gcaggggcag agccagnnnn nnntcgaagt gtgtccnngt 120 

tgtcttttga nccttgttnt ggngccttgc ctanatgtat ntnntntnnn tntnntnatt 18 0 

tnnnnnntnn ntnnnttnct nttnntaaat tgnttnnaan ttntnntann ttnttnnatt 24 0 

nnnannnnnn ntantgtnnt gnattgntat nnatca 2 76 

<210> 58 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 58 

ctgtaagtct ctttcttgcc catcaccaca tccctagtac tgggtatcag tctggccact 60 
tggctttctg gtttgcccca atgtggtcta ttcttgatgc agctaccaaa gtaatgtttt 120 
aaaaccatta taccaagtta ctatccttgt caaaaccccc agtaactgcc aatctcactt 180 
agaataaaat ccggactcct gtgaagcaca gcataaactg gccactgcct atgcagcaac 240 
ctcatcttta ccgtttcctg ccttgctcac tcccttccag cgccgttatt cttcctgatg 300 

<210> 59 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 59 

gacccaggta gaccagctca agagttcatg ttctttgtca tcctngtgtg agctctctgt 6 0 

aagtctcttt cttgcccatc accacatccc tagtactggg tatcagtctg gccacttggc 120 
tttctggttt gccccaatgt ggtctattct tgatgcagct accaaagtaa tgttttaaaa 180 
ccattatacc aagttactat ccttgtcaaa acccccagta actgccaatc tcacttagaa 240 
taaaatccgg actcctgtga agcacagcat aaactggcca ctgcctatgc agcaacctca 300 

<210> 60 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagggtaagg 60 

gtcttattaa aagccaccac tttgctgagg cctgtacagg ccttgggggt ttggggaaga 120 

gaaataaggc aggcacttgt cccttcaggg agggacttgt ccctcactgg gaggtttggg 180 

gttgaccttg gctccagcag agatacccag cctggcgtgg aaggggcagg tctgagctta 24 0 

cgcttgactg cagggcaagc tgcaggcctc ttctgccttc ccctgcattc accaaggaca 300 
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<210> 63 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 63 



<210> 64 

<211> 294 

<212> DNA 

<213> Homo sapiens 

<400> 64 



<210> 65 
<211> 300 



60 
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<210> 61 
<211> 292 
<212> DNA 

<213> Homo sapiens 
<400> 61 

caaggcccga ggtgccatcc cctctgggaa gcagaagcct ggnggcaccc agagtgggta 
ctgtngnggt aaagngntca ccctctcaca gcaccaccag cggcgagaca gaccccacca 12 0 

ccatcttccc ctgcaaggag tgnggcaaag tcttcttcaa gatcaaaagc cgaaatgcac 18 0 

acatgaaaac tcacaggcag caggaggaac aacagaggcn aaaggctcag aaggcggctt 240 
tngcagctga gatggcagcc acgattgaga ggactacggg gcccgtgggg gc 292 



<210> 62 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 62 



cagcaagcct 


tggatcccaa 


60 


agggcagcaa 


tggaacgtca 


120 


caaaatgtat 


ttaacagaga 


180 


cagatgaaag 


ggtctcaaac 


240 


cataataatg 


gtgaaataaa 


300 



tggaaccctg 


ctactgcccc 


60 


gctgcccttc 


acctttgggg 


120 


ggaaaaaggt 


ggacagccgt 


180 


gagaggcaca 


gctccttcct 


240 


gtctctgctc 


cactccccgg 


300 



ctgcgatggg 


gacctcttct 


60 


gcgagagctc 


aagggctgcc 


120 


catgatgcct 


ttgagcttaa 


180 


caagagcact 


gaagacaccc 


240 


atttctgggc 


cccg 


294 



16 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 





PCT/US98/27610 



<212> DNA 

<213> Homo sapiens 



<400> 65 



aattgatgag ccttattaac tatcttttca ttatgagaca aaggttctga ttatgcctac 

tggttgaaat tttttaatct agtcaagaag gaaaatttga tgaggaagga aggaatggat 

atcttcagaa gggcttcgcc taagctggaa catggataga ttccattcta acataaagat 

ctttaagttc aaatatagat gagttgactg gtagatttgg tggtagttgc tttctcggga 

tataagaagc aaaatcaact gctacaagta aagaggggat ggggaaggtg ttgcacattt 



60 
120 
180 
240 
300 



<210> 66 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 66 

agcagatttg tgataaactt gctgtagaag aaaccaaagg ggaacttctg ttgcaactat 60 
gtcgtttgga agatgctgca gatgtttata gaggattgca agagagaaat cctgaaaact 120 
gggcctatta caaaggcttg gaaaaagcac tcaagccagc taatatgtta gaacggctaa 180 
aaatttatga ggaagcctgg actaaatatc ccaggggact ggtgccaaga aggctgccgt 240 
taaacttttt atctggtgag aagtttaaag aatgtttgga taagttccta aggatgaatt 300 

<210> 67 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 67 

tgttcttgta gtgtttgttg ctattgttag aaagattatt agtgatatgt ggggtgtctt 60 

anctaaacaa cagacacatg taagaaaaca ccagtttgat catggagagc tggtttacca 120 

tgcattgcaa ttgttagcat atacagccct tggtatttta attatgagac taaaactctt 180 

cttgacacca cacatgtgtg ttatggcatc actgatctgc tcaagacagc tatttggatg 240 

gctcttttgc aaagtacatc ctggtgctat tgagtttgct atattagcag caatgtcaat 300 

<210> 68 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 68 

agacaaagaa aaggtggcaa tcatagaaga gttagtagta ggttatgaaa cctctctaaa 6 0 

aagctgccgg ttatttaacc ccaatgatga tggaaaggag gaaccaccaa ccacattact 120 

ttgggtccag tactacttgg cacaacatta tgacaaaatt ggtcagccat ctattgcttt 180 

ggagtacata aatactgcta ttgaaagtac acctacatta atagaactct ttctcgtgaa 240 

agctaaaatc tataagcatg ctggaaatat taaagaagct gcaaggtgga tggatgaggc 3 00 

<210> 69 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 69 

aattcnacac gaggtggccc ataagtttta ccttttaaac atccggctgc ctgtgaatga 6 0 

gaagaataaa atcaatgtgg gaattgggga gataaaggat atccggttgg tggggatcca 12 0 

ccacaatgga ggcttcacca aggcgtggtt tgccatgaag acctttctta cgcccagcat 180 

cttcatcatt atggtgtggt attggaggag gatcaccatg atgtcccgac ccccagtgct 24 0 

tctggaaaaa gtcatctttg cccttgggat ttccatgacc tttatcaata tcccagtagg 300 

<210> 70 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 70 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 6 0 

ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 120 

atgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 18 0 

ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 240 

cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 300 

<210> 71 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 71 

ggaaatgcaa gtcaaaacag ctttgtaggt ctcagagttt gcttttaaga agtagtacaa 6 0 

gaaggaatag ttatatcaat acaccagtgg ctgaaattat catgaaacca aatgttggac 12 0 

aaggcagcac aagtgtgcaa acagctatgg aaagtgaact cggagagtct agtgccacaa 18 0 

tcaataaaag actctgcaaa agtacaatag aactttcaga aaattcttta cttccagctt 24 0 

cttctatgtt gactggcaca caaagcttgc tgcaacctca tttagagagg gttgccatcg 300 

<210> 72 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 72 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 6 0 

ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 12 0 

aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 18 0 

cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 24 0 

ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

<210> 73 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 73 

ataacacaca tcacagtatg ctctcagaaa tttctttatt tgaaccctat accaatatct 60 
gttgatcaat gaccattttt gctcagcatg gagaaacagt gccctgcatg aagggtagtg 12 0 
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agaataaaaa ggatcttacc acctttatca tgagggtggc tttgctctct ccattccaag 
ttgttctctg ttctagaaag cagatgtagt agacatctac tgtttttgcc taaacagaat 
ccctttttcc tttttttgtt aaaagtactc atccctaata ttacattgtt ctggaaggac 



180 
240 
300 



<210> 74 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 74 

cagagtcaac atggagcatc tcactgtgaa atgatccatg gattgaagga tatggtaaaa 6 0 

tgtttatagg ttactttgaa agtaaaatat actatgtctt ggttttgagg atattggata 120 
caaaactctc ttcctttagg gctactgaga cttgattcct gatcatcaga aatttcacca 180 
gaaacaactt gcttccaata tacccaattc tatatgaaga attcatggag agtgtactgg 24 0 

cactgnnnnn nnnnnnngan ncntgctgct ncgaanntnt nntattnact gannntgaat 300 



<210> 75 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 75 

caagagagag tgatagaatt ggcagtgaaa tatacgaacc accctcctgc cctctgggtt 6 0 

cacaatacgt gtacacttga ctgtgaagtg gctgtgagag tgggtggaga gttcttcttt 12 0 

gaccctcagc ctgcggatgc ctctagaaac ctcgtgttga ttgcaggagg agtcggaatt 18 0 

aaccctctgc tttccatcct gcggcacgca gcagatctcc tcagagagca ggcaaacaaa 240 

agaaatggat atgagatagg aacaataaaa ctattctaca gtgcaaaaaa taccagcgaa 3 00 

<210> 76 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 76 

gctagacgaa gtggtgaagc ccaaggactt atttttgagc tcgctgtaag actgagaaat 6 0 

cacgtactcc ttcctgaaac cactaagagg aaaaatgtct gtgacactgc atacagatgt 120 
aggtgatatt aaaattgaag tcttctgtga gaggacaccc agaacatgtg agatggagtc 18 0 

tcgctgtgtc ccccaggctg gagtacaatg gcgcgatctc ggctcactgc aacctccgcc 240 
tactgggttc aagcaagtct tctgcctcag cctcccgaga actgcaagag gaggcaactg 300 

<210> 77 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 77 

agagactttt gtttgtgttt aattagggct atgagagatt tcaggtgaga agttaaacct 60 
gagacagaga gcaagtaagc tgtccctttt aactgttttt ctttggtctt tagtcaccca 120 
gttgcacact ggcattttct tgctgcaagc ttttttaaat ttctgaactc aaggcagtgg 180 
cagaagatgt cagtcacctc tgataactgg aaaaatgggt ctcttgggcc ctggcactgg 240 
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ttctccatgg cctcagccac agggtcccct tggaccccct ctcttccctc cagatcccag 300 

<210> 78 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 78 

caggagcaat caattcctgt cgaagtgaat accatgcagc ttttaacagt atgatgatgg 60 

aacgcatgac cacagatatc aatgcactga agcggcagta ctctcgaatt aaaaagaagc 12 0 

aacagcagca ggttcatcag gtgtacatca gggcagacaa agggccagtg accagcattc 18 0 

tcccgtctca ggtaaacagt tctccagtta taaaccacct tcttttagga aagaagatga 24 0 

aaatgactaa cagagctgcc aagaatgctg tcatccacat ccctggtcac acaggaggga 300 

<210> 79 
<211> 278 
<212> DNA 

<213> Homo sapiens 
<400> 79 

gtgctgcaga ggaagacagc ctgtcaggat actgacgagg aggaggaaga ggaagatgat 6 0 

gatcaggctg aatacgacgc catgttgctg gagcacgctg gagaggccat ccctgccctg 12 0 

gcagccgcgg ctgggggaga ctcctttgcc ccattctttg ccggtttcct gccattattg 180 

gtgtgcaaga caaaacaggg ctgcacagtg gcagagaagt cctttgcagt ggggaccttg 24 0 

gcagagacta ttcagggcct gggtgctgct cagcccag 278 

<210> 80 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 80 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 6 0 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacctgt ttctgggagg 12 0 

tgagagcagc ccagaacccc tggacaacat cttgttggca gcctttgagt ttgacatcca 18 0 

tcaagtaatc aaagagtgca gcatcgccct gagcaactgg tggtttgtgg cccacctgac 24 0 

agacctgctg gaccactgca agctcctcca gtcacacaac ctctatttcg gttccaacat 300 

<210> 81 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 81 

acctgtaccg cctggccact ggctgtcacc ggcgtgatga gctgccggtg ttngaacgca 6 0 

acctatgctg gactctcccg gcagactgcc tggatatggt cgccatgcag gaagccgccc 12 0 

agcacctcct cggcacacac gacttcagcg ccttccagtc cgctggcagc ccggtgccga 18 0 

gccccgtgcg aacgctgcgc cgggtctccg tttccccagg ccaagccagc cccttggtca 24 0 

cccccgagga gagcaggaag ctgcggttct ggaacctgga gtttgagagc cagtctttcc 300 

<210> 82 
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<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 82 

cccagctgga cctggtggcc ctttcctagt gcctctgctg ggggaggaga acctgggtcc 6 0 

acgtggaggc taggaggtct caggtgctgc cctggcagca ccagagtgtg ggccgggccc 12 0 

gagtgtctgc ccctcggccc tcagggtggg gcacttagca cccagaaggg accaaaagca 180 

gggcatggcg gtgcagagga gtttgggagg tgtaaacagc cccatgcacg tggaggagga 24 0 

gctggctttc agccccagac cccacgctag cactttccac gctgcttgcc cgctgttgat 300 



<210> 83 
<211> 272 
<212> DNA 

<213> Homo sapiens 
<400> 83 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt 
agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata 
gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat 
aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt 
gaaatggttg ttcatgtcat ttgctcattt tt 

<210> 84 
<211> 300 
<212> DNA 

<213 > Homo sapiens 



<400> 84 

cccactgccc ccggtcaaca aacccacttt tatgacagtt ttcttccgca gcttggctct 6 0 

taaattttac tggcaggtgt atggttgttg gagggttcct agtgagttgg gggacctggc 12 0 

aatagagctg cttggttgga ggaagtgaag ctggcttagt accagcagct gatctcttcc 180 

acgtgctgct gctttttttg ccactctgat actaaaccag agaaagctgc aggtggataa 24 0 

agaagctgtg gctgtttttt gcttttgggt ggcaatgaga aagagtcaca gtgtgggtta 300 



<210> 85 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 85 

ctgggagcca ccaacatagc agattaccat gtgaagttgc cactgctgca tctcctgaaa 6 0 

cctggctgat gggagaggtc tcattttgtg tctgagaatg tccaggttgt ctgcagacca 12 0 

cagcactgat ttcccattag cagttattat ttcctggcca tttcttcctg aaggttttgt 180 

ggttaaactc cctgtcctca atattttatc agcagtaggg ctgtcattct tctggttatc 240 

aacctctaca ttatgaagta aggttcaacc cttctgcttt tctcaggccc ccaaaacggt 300 



<210> 86 
<211> 300 
<212> DNA 

<213> Homo sapiens 
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cacatgtgag 6 0 

attttggcca 120 
tactcattga 180 
cctcctttat 240 
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<400> 86 



gtgcaggacc 


cggaggtgct 


ttccttgcca 


60 


atcccccaat 


cccagacagc 


aattatgtca 


120 


tttggcccat 


ctttttattt 


ttaaaaaatt 


180 


gggtggcgtg 


ggatgggtga 


ccactaagtt 


240 


ggccctgata 


gaactttcca 


ggagttcatg 


300 



<210> 87 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 87 

ctccaaggaa aatccacctc gcagcttgta aatctacagc ctgattacat caaccccaga 

gccgtgcagc tgggctccct tctcgtccgc ggcctcacca ctctggtttt agtcaacagc 12 0 

gcatgtggct tcccctggaa gacgagtgat ttcatgccct ggaatgtatt tgacgggaag 180 

ctttttcatc agaagtactt gcaatctgaa aagggttatg ctgtggaggt tcttttagaa 240 
caaaatagat ctcggctcac caaattccac aacctgaagg cagtcgtctg caaggcctgc 



<210> 88 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 



300 



60 



<400> 88 

ctgaaacaaa agatgtattt caattaaaag acttggagaa gattgctccc aaagagaaag 

gcattactgc tatgtcagta aaagaagtcc ttcaaagctt agttgatgat ggtatggttg 120 

actgtgagag gatcggaact tctaattatt attgggcttt tccaagtaaa gctcttcatg 180 

caaggaaaca taagttggag gttctggaat ctcagttgtc tgagggaagt caaaagcatg 24 0 

caagcctaca gaaaagcatt gagaaagcta aaattggccg atgtgaaacg gaagagcgaa 3 00 

<210> 89 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 89 

ggggacatgt gtccctcagc tcagcagagg ctgtggtaca acatggtcct tggtgaagac 60 

ctgcacccct ggaacctccc accatcgtca caactgtagt ctcatttgca gtggagaaaa 120 

gaacccgatg tcccacagcc agatatacac ccagctccat gccagccctt catgtttacc 180 

ttttgctttg ttaattacat gtcagactcc tagagggcct ccagactaat aggaagcatt 240 
tctgtaacca acctgccacc cactgattca gaaatggaaa tcacattcca caatctatgg 

<210> 90 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 90 

ctcatacaga aagtcagatc aacaaagagt ccaagaaaaa tgcgacccag ctagaccatt 

22 



300 



60 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 





PCT/US98/27610 



tgatcccagg cttagcacac gattgcatgg catccccttt agccacttca accactgcag 

acatccagga agctggactc tctcctcagt ccctccagac ttctggccac cacagaatga 

aaaccccatt ttcaactgag ctatctttgc tccagcctga tactccagac tgtgctggag 

atagtcatac cccactggct ttttccttca ccgaggactt ggaaagttct tgtttgctag 



240 



300 



120 
180 



<210> 91 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 91 

aatgcaaagg gctgcagttc tcattcaggc tactttcagg atgcacagaa catatattac 6 0 

atttcagact tggaaacatg cttcaattct aattcagcaa cattatcgaa catatagagc 12 0 

tgcaaaattg caaagagaaa attatatcag acaatggcat tctgctgtgg ttattcaggc 18 0 

tgcatataaa ggaatgaaag caagacaact tttaagggaa aaacacaaag cttctattgt 24 0 

aatacaaggc acctacagaa tgtataggca gtattgtttc taccaaaagc ttcagtgggc 300 

<210> 92 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 92 

aagatatgca gagatattcc aggatctttt agctttggtg cggtctcctg gagacagtgt 6 0 

tattcgccaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 120 
cattgcactt atcttaccaa gctcttctga aggttctatt tctgaactgg agcagctctc 180 
caattctcta ccaaataaag aattgatgac ctcaatctgt gactgtctgt tggctacgct 24 0 

agctaactct gagagcagtt acaactgttt actgacatgt gtcagaacaa tgatgtttct 3 00 

<210> 93 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 93 

cgattcgcca gttctccatt ctgagagtca atcacgttcc tgataggttg tcattgattt 6 0 

ttttcttcgt tggttttaac cttctaaaca tctccaggcc actttcttag cctttttcta 120 

ggtactaaaa agaggtccta cccacacctg cctcacactt ctcctttcca aggctgcctg 180 

agtttggagg ggcttgggtg tgtgtgaaca agggccctgc attgtctagg cctgcagttc 240 

ccaggcttgg gttcactttc accatgcatt ggcaaaacta gaaaagtaag cttgtgacaa 3 00 

<210> 94 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 94 

tttgtgcctg agcacccaca atttcaggat ttagactgtg tggcacctca gctttcctct 60 

ggatgtaacc actccttggt gagagaggga actcctcacc aatcccattt gacaaaggct 12 0 

aggcaatctt cattctgcrt ggctttagtc attcttgtca ttgggctgca gaagaaaaac 180 

aactttgctg ggtgatccca ctgccttgat ttcacctcgg agcgaggctg ggccatgtcc 24 0 
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aagtcttatg aggtcaccct gactagaaaa aattgaactc acctacaaat agtctgaaag 3 00 

<210> 95 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 95 

gtgagtccga gcatcagtgg cttctggagc agaccagcca cgtggaagag aagccttaca 60 
gagatgggtc ggcagagccc tgctgatggc tgggccttgt gggcagccac tctgtgtgag 12 0 

cagggtgttg ggcccataca cttcaaagac cagagccctg cactgggaga gtgctcctgg 18 0 

cccaggctgg gaatcacctt tcgaggccct tcagactctg gcggggcttg ctgtggcctc 24 0 

cctccagcta gtggtgtggc tgagcagact ccagggccag ggccagttcc cttctcccct 300 

<210> 96 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 96 

acaactccag acataattaa agactggccc aggaggaaga gggcggtggg ctgtggcgcc 6 0 

ggctcctctt ccgggagggg cgaggtcggt gcagaccttc ctgggagcct gtcactgctt 12 0 

gagacagagg gcaaggacca cggccttgaa ctcagcatcc acaggacgcc catcttggag 180 
gattttgagc tcgagggagt gtgccagctc ccagaccagt cgcctcccag gaacagcatg 240 
cctaaggccg aggaagcctc ttcctgggga cagtttgggt tgagttccag gaagagagtc 300 

<210> 97 
<211> 286 
<212> DNA 

<213> Homo sapiens 
<400> 97 

gtccagggcc cangttttaa tttnttttta aaaagcttta ggtcttgccg ggacggtggt 6 0 

tcacncnnnn nnnnnnnnnn nnnnnnnagg cctaggcggg tggatcacaa ggtcagcagt 12 0 

tcaagaccag cctgaccagc atggtgagac cctgtctcta ctggaaatac aaaaaaattg 180 
gctgggcgag gtggcaggca cctgtggtcc cagctacctg ggaggctgag gcgggagagt 24 0 

ctcttgaaac tggaaggcag aggttgcggt gagccgagat tgcgcc 28 6 

<210> 98 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 98 

caccattttt attttgatgc ttacactcat ttattctgtt tttgtaaaac agtttcggga 60 

atttaaaaat ccttccagtt aatagagctt ttgttattat attataattt tgtaaaccca 120 

ctttgttttt cccactttaa agccacaggg tcgactcatg gatgatacct ctattgctgc 180 

tgcatgatgt tcaagaccgg cccttggctg ttgttacaga gatgttgggc agagctatgc 24 0 

aggtgtttca ttgtgaactc tagctttgat catggtaaaa agttaaccct ttctattttt 300 

<210> 99 
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<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 99 



agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 
ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cctccaagtc 
tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg ggcttctatc 
tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 
cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctggcctcca 



240 
300 



180 



120 



60 



<210> 100 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 100 

tccaaccctg gcgatgtcac cagcatggtg gctcaggtta gagctctctg aggacccagc 60 
atagagcact ggtgccaggg accaaactga gaccccacca ccgtcatcaa cacttacata 12 0 

ccataaaggt cttcagagtg ccttggccct agacctccct tcattctttg tagagatgga 180 
atctaagaat gaaacatctc cactcagtcc tgcaaatatg gaagttcttg agataccttt 240 
ttttggtaga tacttgtgct ggtattctga gagtcacttt actctgatgg tttgcaagat 3 00 

<210> 101 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 101 

gtgtttcttc tacctcccct gcacaacatt gtttatatgc cccctaaaat gtaacttctt 60 

tagattctgt tgttacgtgc aacactgtat atctctccat agcacttaat cagagtttgt 120 

aattaggcat ctttttgtgt gattatttgg taaatgtcca tatcccctac tagcctataa 180 

gctccatgac ttctaggtac cctgtctgac tacgtgtatc actggttcta ccgcctaaca 240 

ttgcctagca cattcattgc ttcacaggca tctgaatatg ggtttataaa atacattgct 300 

<210> 102 
<211> 270 
<212> DNA 

<213> Homo sapiens 
<400> 102 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 60 
tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 12 0 

tggaggaggt gtnnnnnnnn nnnttnntgn ncccactacc ntgcactgaa ctggccntgt 180 
tacancaann actgnncccn nttgttatna cacctntnac aaacacctgc tgctgtacat 24 0 

gncnctactt taaggactnn anacctgtgc 2 70 

<210> 103 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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<400> 103 

gctggagcac gctggagagg ccatccctgc cctggcagcc gcggctgggg gagactcctt 6 0 

tgccccattc tttgccggtt tcctgccatt attggtgtgc aagacaaaac agggctgcac 12 0 

agtggcagag aagtcctttg cagtggggac cttggcagag actattcagg gcctgggtgc 18 0 

tgcctcagcc cagtttgtgt ctcggctgct ccctgtgctg ttgagcaccg cccaagaggc 240 

agaccccgag gtgcgaagca atgccatctt cgggatgggc gtgctggcag agcatggggg 3 00 

<210> 104 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 



<400> 104 

ctcgcgtctc ttcactgcac attgcaatgc atttgcgatt cccatttctc tgctaggagc 

cagcctgggt ggcgctgctc ccagagccgt gggtcccaga ccttgcgttc cttttgttcc 12 0 

tgtccgttta tcaggacacg ggccccacct gtcacgtgcc cgaggccacc caagcccagc 180 

ctgcggggcg ttcccactgc ctggatgccg gcttgagttc tgcgcacgca ggattcagtg 24 0 

tggggacggc ccctgccgga taggcctagc cctggcccag gtggtgagcg gtttgcagtg 3 00 

<210> 105 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 105 

gggcactgtg gggctctccc cgcctctcct gccttgtttg cccctcagcg tgccaggcag 60 
actgggggca ggacagccgg aagctgagac caaggctcct cacagaaggg cccaggaagt 12 0 

ccccgccctt gggacagcct cctccgtagc ccctgcacgg caccagttcc ccgagggacg 18 0 

cagcaggccg cctcccgcag cggccgtggg tctgcacagc ccagcccagc ccaaggcccc 24 0 

caggagctgg gactctgcta cacccagtga aatgctgtgt cccttctccc ccgtgcccct 3 00 

<210> 106 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 106 

gctcaacgcc tatgtgaccc atctccatgc cgaatacaat cgacagaagg acatctacct 6 0 

agcacatcgt gtggcccaag cttgggaatt ggcccagttc atccaccaca catccaagaa 120 

ggcagacgtg gttctgttgt gtggagacct caacatgcac ccagaagacc tgggctgctg 18 0 

cctgctgaag gagtggacag ggcttcatga tgcctatctt gaaactcggg acttcaaggg 240 

ctctgaggaa ggcaacacaa tggtacccaa gaactgctac gtcagccagc aggagctgaa 300 

<210> 107 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 107 

tgtgagtttc ctatctgttc cagactagta tcgccaatct ctcccagctc tcttctttcc 
tccctggcct ttgtcctgca ggaggtagca tcacctcttg gcattttgta catgctttta 12 0 



60 
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aacaattgga ggagctgccc aggcagtttt atggcctcct ggttgtgtgc cttcacaccc 
gcctacagcc ccacctcacc atcaagcgct gagccaatgc gggtgtggct ggccctgagt 
tcctgagtca gctccttgcc agggccagag ctggtaacag cggggcagca gggtgggtag 



180 
240 
300 



<210> 108 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 108 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 12 0 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tgcgacctcg 180 

ggggggccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 24 0 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 3 00 

<210> 109 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 109 

cacaaggaga agaaagttaa ttaacattga aagatgagaa gacatcttgg aagaacttga 6 0 

attgggcctt ggaagaagaa cagccattca aatagataga attgtggtag caaaggcata 12 0 

gaggtaggaa agtatagatc tccagggaca gtagtcatgg ggttggggca ctgttggaat 180 

ttaaggttgg aaggatatat tggagcccct tgaatacggt aacaaggcac accttgggca 24 0 

gtggagagtt atcagagtgt ttgaaaagga gggttattga gtaaataaat agactggtac 300 

<210> 110 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 110 

gacaccccag atgcagccac caccagcaga agcgatcagc tgaccccaca agggcacgtg 6 0 

gctgtggccg tgggctcagg tggcagctat ggagccgagg atgaggtgga ggaggagagt 12 0 

gacaaggccg cgctcctgca ggagcagcag cagcagcagc agccgggatt ctggaccttc 180 

agctactatc agagcttctt tgacgtggac acctcacagg tcctggaccg gatcaaaggc 24 0 

tcactgctgc cccggcctgg ccacaacttt gtgcggcacc atctgcggaa tcggccggat 300 

<210> 111 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 6 0 

tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 12 0 

tggggggggt ggggnnttcc ttctannccn ntacnctatg tgtttaatnn ncntantnct 18 0 

ttantantat ttnccantgn tnntnatatn nttnnanana nncntnctta nnnacattat 24 0 

tttanntang ngatnntacc ttnntgnaan g 271 
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<210> 112 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 112 

gttccctcac cttattcctc caagttcccc cttgggaacc tctgagatta acttgataag 60 

ctccttgggc aagctcttta tcctaagatt cctcagtgag ccttatagag ttgctgcgag 12 0 

aattacattt gttcatgatg tcaagtgtct ggtatgtagc taatgcttat tgaacacata 18 0 

gtaatttatt gaataattgt catgatcact ggatgagata tagccactgt ggaggtaggc 24 0 

acaccagggt tttagaggct tgggatcttg caacaggatt ttcctcttgc ctctccaaac 3 00 

<210> 113 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 113 

cccacatgta ccaggttgag tttgaagatg gatcccagat agcaatgaag agagaggaca 6 0 

tctacacttt agatgaagag ttacccaaga gagtgaaagc tcgattttcc acagcctctg 12 0 

acatgcgatt tgaagacacg ttttatggag cagacattat ccaaggggag agaaagagac 180 

aaagagtgct gagctccagg tttaagaatg aatatgtggc cgaccctgta taccgcactt 24 0 

ttttgaagag ctctttccag aagaagtgcc agaagagaca gtagtctgca tacatcgctg 300 



<210> 114 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 114 

acagttagtg taaaggatct gaatggcata gacttaactc ctgtgcaaga tactcctgtg 60 

gcttcaagaa aagaagatac atatgtacat tttaatgtgg acattgagct ccagaagcat 120 

gttgaaaaat taaccaaagg tgcagctatc ttctttgaat tcaaacacta caagcctaaa 180 

aaaaggttta ccagcaccaa gtgttttgct ttcatggaga tggatgaaat taaacctggg 240 

ccaattgtaa tagaactata cacgaaaccc actgacttta aaagaaagaa attgcaatta 3 00 

<210> 115 
<211> 288 
<212> DNA 
<213> Homo sapiens 

<400> 115 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 120 

gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 

gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 

ctcttgtaag gcaggaggat anaanaggac cannaanggn gggagctg 2 88 

<210> 116 
<211> 300 
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<212> DNA 

<213> Homo sapiens 



<400> 116 



tcaattagta acatctgaaa aaacagcttt gtcctgggtg aaaaaggatg ccaaaattgc 
ctggaaaaga gcagtgagag gagtccggga gatgtgtgat gcatgtgaag caacattgtt 
taacattcac tgggtctgcc aaaaatgtgg atttgtggtc tgcttagatt gttacaaggc 
aaaggaaagg aagagttcta gagataaaga actatatgct tggatgaagt gtgtgaaggg 
acagcctcat gatcacaaac atttaatgcc aacccaaatt atacctggtt ctgttttgac 



60 
120 
180 
240 
300 



<210> 117 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 117 

gcactttcca gaattctctc atatttgtgg gctgggatca agcctgcagc ttgaggaaag 60 
cacaaggaaa ggaaagaaga tctggtggaa agctcaggtg gcagcggact ctgactccac 120 
tgaggaactg cctcagaagc tgcgatcaca actttggctg aagcccctgc ctcactctag 18 0 

ggcacctgac ctggcctctt gcctaaacca caaggctaag ggctatagac aatggtttcc 240 
ttaggaacag taaaccagtt tttctaggga tggcccttgg ctgggggatg acagtgtggg 3 00 

<210> 118 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 118 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 6 0 

gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 
tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 
gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 
cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 300 

<210> 119 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 119 

gaacaaagaa ggaatgtctt cctcatgttt gggtctatag aagacgttaa agaaaacttc 60 
cagaaagtgg gtttgaggca tgagccacca cgcctggcca aaggatttaa tgaattaatg 12 0 

gatgtacagt gctggggctg gtattctagg gcctgcattg agactcacat tttgccatca 180 
aaagcctttt aagaggtgga ggttgcggtg agctgacatg gtgccactgc actccggcct 24 0 

gagtgacaga gtgagactct gtctcacaaa aaaaataatg ccctttaaat aatgaataat 300 

<210> 120 
<211> 273 
<212> DNA 

<213> Homo sapiens 
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<400> 120 

cctcagcctt ctaaaaagct ggggctacac ccagctgaag aaattgtaac taaagataga 6 0 

ttgtttaaag caaagcaaga aacttctgaa gaaatggaac aaagtggaga agcctcagga 12 0 

aagcccaaca gagagtgtgc accccagatt ccttgtagta ctcctattgc tactgaaagg 180 

acagttgcac atttgaacac tctgaaggac cgtcacccag gtgatttgtg ggcccgcatg 24 0 

cacatctcat cccttggaat atgctgcagg aga 2 73 

<210> 121 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 121 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 60 

gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 

tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 

gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 

cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 3 00 

<210> 122 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 122 

gttgcaagca gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtt 6 0 

ccagaatgat acacggatca gtgcagaagt tcatcaggct ctcggacctt agggctgttg 12 0 

gagaaggctt cagcagcaga actgatggtg aaggctcgtg ttctccatcc tcaactttct 180 

ttgcttcgat catacacaag aatacatttg gaagggcaaa aaaatgaaca ctgtcgttca 24 0 

ttgcagccgt gttttgtgac acagatgcac agtctgctgt gaagaccttc tctcaagtgg 3 00 

<210> 123 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 123 

gtgatttcag cttccaaact ggtatacatt ccaaactgat agtacattgc catctccagg 60 

aagacttgac ggctttggga ttttgtttaa acttttataa taaggatcct aagactgttg 120 

cctttaaata gcaaagcagc ctacctggag gctaagtctg ggcagtgggc tggcccctgg 180 

tgtgagcatt agaccagcca cagtgcctga ttggtatagc cttatgtgct ttcctacaaa 240 

atggaattgg aggccgggcg cagtggctca cgcctgtaat cccagcactt tgggaggcca 3 00 

<210> 124 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 124 

catgctggcc agcatccctg cctgtgcaag ctctggatga gctgtgagcc cctgccaccc 6 0 

acacccccac tccctgccag cctggcctca gggcctctga tccatgtgca ctggagagga 120 
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gatgactgac agggccactg gggcatttcc acgttaacag 
agtgactcgc caatggaggc atctcagatg tgggcccagg 
acagggctat ccattcattg tcccaccaaa ggctatggag 



cagctgccac 
agtctgggga 
cccacccacc 



tggcaaaaga 
gctactttga 
atgtgctgga 



180 
240 
300 



<210> 125 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 125 

ggtaaattgg ttgaattatt gtattgaagc ttgagctgta gctaaaagta atttaggttt 6 0 

cccctaagat gttattatgt tagggacata acacttttgg gaggttgttg tgggagatgg 120 
ttgatttagg ttttcaaaag ctagaaataa aatttacatg ccttagattt cataaaattc 180 
tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctcctaagg ttatgtccta 24 0 

ataactattc ttttaggagt atacttctac tttatagaag gttgcttttc tttttaattt 300 

<210> 126 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 126 

tgaagaggag atcggtgacc tgggctcctt atgtgcctga aagagtttga gtttcctgtt 6 0 

aactccaaat caacagtatt ttcaacaaga aatgtgcaat tgaaatcaag tgctgtttaa 120 
gtgcagctag gatttccaca ggaagacact tgcagtgaac agagttatgg agcagcaaaa 180 
acacagatct atttggaaaa agagaaaaca tatgcgttgt attttgcttc aattataaaa 240 
taccatcctc tcaaaggtgg ttctaaatta caaaggactt tgatttctag gtagattctg 300 

<210> 127 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 127 

ggtgattccc atgctgaaca gtttgatctc ctgccagagt gtcgggccac aaactgggca 6 0 

gcacatcagg atcacctggg ggccttcaaa aatcaaaaat ccacccccag gccatgccct 12 0 

ggacccactg caccaggaca agaaatccac cccaggcctc tccccagacc cactgcacca 180 

ggacaagaaa tccaccccca ggccacgccc cagacccact gccctaggat gtgggggtgg 240 

gaaccaggtg gtgctttgta aagacgtgca ggtggtaacc ccaggccccc acgctcggaa 300 

<210> 128 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 128 

tgagctggga gaaggggaga aagtttgtga agaggagatc ggtgacctgg gctccttatg 60 

tgcctgaaag agtttgagtt tcctgttaac tccaaatcaa cagtattttc aacaagaaat 12 0 

gtgcaattga aatcaagtgc tgtttaagtg cagctaggat ttccacagga agacacttgc 18 0 

agtgaacaga gttatggagc agcaaaaaca cagatctatt tggaaaaaga gaaaacatat 24 0 

gcgttgtatt ttgcttcaat tataaaatac catcctctca aaggtggttc taaattacaa 300 
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<210> 129 
<211> 285 
<212> DNA 

<213> Homo sapiens 
<400> 129 

ggaaagcaca aggaaaggaa agaagatctg gtggaaagct caggtggcag cggactctga 6 0 

ctccactgag gaactgcctc agaagctgcg atcacaactt tggctgaagc ccctgcctca 120 
ctctagggca cctgacctgg cctcttgcct aaaccacaag gctaagggct atagacaatg 180 
gtttccttag gaacagtaaa ccagtttttc tagggatggc ccttggctgg gggatnnnnn 240 
nnimnnnnnn nnnnnnnnnn nnaggaagat accatttctt gacgg 285 

<210> 130 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ccggacgcag gccctcgggc aggagcatct ggcagagtgg ggggcgtggc aggcaccctc 6 0 

ctttgcaggg cgaggtgggg cctctgcagc catcctggac aggccggggt ggcggcagct 120 
ttgcccacgt ggaagcgggg tgggtctcac ttgcgtggtg gcccctggcc ccatcttgcc 180 
tgctgcggcc tggggagcag gcgctgggtg gtggttctgc ctgcttgctg ctcgttcccc 240 
gggcatgcgt gggcagcggg gggcatgcgt gggcagcagg gggccgtggg cagcgggggc 3 00 

<210> 131 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 131 

gatctctata ctagtgaaca gtgccagttc cacactttgg acttagaact gttctctagt 60 
tattgtaaca cagaatactg tcaatcccta atttacttaa tgt tact tat tggaagtggg 12 0 

gctgatgaaa tacgcacagg agggaaatct actgtgttta ggcacaggca gccccagtgt 180 
ataaggagat catattccaa aaggttgtca gttggttgtt tgcaacctgg aatgtatttt 24 0 

cctttagaga ccaggttatc catggtggtt aggcccctag agcagctgga aaagatgatc 3 00 

<210> 132 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 132 

ctcccatgga ggtggtggga atggcaccga gaagtttgat gacagttatc taatggacta 6 0 

gaggttggca aactttctgt aaatggccag gtagtaaata gttctgcttt tgaaggcata 12 0 

tggtctcttg cacctactcg aggctgaaag cagctataga caatacataa atgaatgagc 180 

gtgagtgtgt tccaataaga aaaaaacatg gctgtttgct tcggccccag ggttgtagct 24 0 

taccagtcct gtaacagatc acagtttgct cttttggtca caaatacttg aacccctccc 3 00 

<210> 133 
<211> 269 
<212> DNA 
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<213> Homo sapiens 



<400> 133 



atgctatgcc aaagcctgct gccagctcca tagcctggac ctacagcact gcatggtgga 

gtccacagct gtggtgagct tcttggagga ggcagggtcc cgaatgcgca agttgtggct 

gacctacagc tcccagacga cagccatcct gggcgcactg ctgggcagct gctgccccca 

gctccaggtc ctggaggtga gcaccggcat caaccgtaat agcattcccc ttcagctgcc 

tgtccaggct ntgcaaaaag gctgccctc 



60 
120 
180 
240 
269 



<210> 134 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 134 

gatggatgag actgttgctg agttcatcaa gaggaccatc ttgaaaatcc ccatgaatga 60 

actgacaaca atcctgaagg cctgggattt tttgtctgaa aatcaactgc agactgtaaa 120 

tttccgacag agaaaggaat ctgtagttca gcacttgatc catctgtgtg aggaaaagcg 180 

tgcaagtatc agtgatgctg ccctgttaga catcatttat atgcaatttc atcagcacca 240 

gaaagtttgg gatgtttttc agatgagtaa aggaccaggt gaagatgttg acctttttga 3 00 

<210> 135 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<4O0> 135 

ggcgagcggg aacagctctt gaggagtgag actgcaggag atgtgggccg tgccaaagag 6 0 

atggatgaga ctgttgctga gttcatcaag aggaccatct tgaaaatccc catgaatgaa 12 0 

ctgacaacaa tcctgaaggc ctgggatttt ttgtctgaaa atcaactgca gactgtaaat 180 

ttccgacaga gaaaggaatc tgtagttcag cacttgatcc atctgtgtga ggaaaagcgt 24 0 

gcaagtatca gtgatgctgc cctgttagac atcatttata tgcaatttca tcagcaccag 3 00 

<210> 136 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 136 

gacttctaaa tatatcttgg atataatagg tgataagttc tgtcaattag taacatctga 60 

aaaaacagct ttgtcctggg tgaaaaagga tgccaaaatt gcctggaaaa gagcagtgag 12 0 

aggagtccgg gagatgtgtg atgcatgtga agcaacattg tttaacattc actgggtctg 180 

ccaaaaatgt ggatttgtgg tctgcttaga ttgttacaag gcaaaggaaa ggaagagttc 24 0 

tagagataaa gaactatatg cttggatgaa gtgtgtgaag ggacagcctc atgatcacaa 300 

<210> 137 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 137 
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ttgacaaatt gctggaacac acttattgtg gtttacccgg ttttaattat gtcagagatt 60 

gcatcatcct tatgcttgtt tacatctata atcttctatg aaatggtggt accaaggggc 12 0 

gcccaacagc ttttatcccc attcttagag catattcttt attataatga ttatccaaca 180 

tatttcttta attttaatac aaaaaataca tcatttaatt tttgttacat atgaacattc 240 

atttttaaat gctcagcctc aagtgcaggc atttttgagt ggcctgatta catattcctc 300 

<210> 138 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 138 

ggaaggggag ggttggtgag tcccagacct taaaaataca aggttaagag ggaccccaaa 6 0 

gcaaaaaatt ccaacccttt tcctcccagt cattgaaaca ccaaaactat tataccggag 12 0 

ggtgtaatag ttttgctgcc cagttgtggt aggccagtag tggcctccca agatgcccat 18 0 

gtcctaatcc caggaacctg tcaaaattac cttgtatggc caaaggggct ttgcagatgt 240 

aatgaagtta aggatctttc gccaggaaga ttatcccagc ttgttcagga gggcttgatg 3 00 

<210> 139 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 139 

gacatcattt tcttattcta gtaagagaaa gtacacagat tcaactttag agaggacttt 6 0 

tttttttctg gagctaaatc aaggaaggat tatcacgtgg cctcccttga atataatttt 12 0 

gaagctgtga acagtaccat cagtaacatt ttatggacag ctctgatggt ttttatacca 18 0 

cggcactctt cttacctttg ggggaagcta tctggagtta tgactgatgt gtaaagtggt 24 0 

ttactgttag aatcctggtg tgctaggatt ctgggagagt cactttcagg aagttacctg 300 

<210> 140 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 140 

gctgcccagg cagttttatg gcctcctggt tgtgtgcctt cacacccgcc tacagcccca 60 

cctcaccatc aagcgctgag ccaatgcggg tgtggctggc cctgagttcc tgagtcagct 12 0 

ccttgccagg gccagagctg gtaacagcgg ggcagcaggg tgggtagcct ctaccagcca 180 

gggcagtccc tgaggggcca gcaggggggc tgactgccta gtggctcaac ctcctgaacc 240 

cacccactcc cagcgatgct acccagaacc ccaacggcat gaatcctgca cagtgccggg 300 

<210> 141 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 141 

cccaaactta tcgggggtgc cagaggcaga gtagacaagc cttagtggcc gccatttgtt 6 0 

gaatatctac tgtgcgccaa gcagtgcgtc acaactttat gaagtaggta ttattatcat 12 0 

ccccatttta caggtgaaga aactgagtct ctgagagacc aacttttcca aggtcacaca 18 0 
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gaggtgggat ccagcccact tccgtctgac cccaagcccc tgctgttaac ccctgcccca 
ttgtggggag gttccggccc actctggagt tctctggtct gcgtcagtcc tcaggagaag 



240 
300 



<210> 142 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 142 

gaaaggtggc gcgcttctca cggctgagtt gctgcgcctg cagacggaag ctccccacag 6 0 

gcagagctgc ttggatgtgt gagtcatgaa gccagagaag ccccgctcca tgagcagtga 12 0 

ctccccaggc cctgtgacct ccctcctgtc ttgcagctcc tcctggcacc agtccccagg 180 

gctctcctgt tggtagttcc tgcttttctt cttggaaatt cctcgtggac ctcgagatct 240 

ttaccctaaa atagttctgt tgaatttcac cctggcaatg taaattgata gcttatcttc 3 00 

<210> 143 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 143 

cttggccttg cttctctgag aaaactttgg tcacacctcc agagccaggg tgggtgcctc 6 0 

cctggaggag ggggctttcc tggttggtgg cacagcagga gtccaggctt tgtaccgtgg 12 0 

acaccatggg ctatggcaac accttcctca ccatccttcc atgaggacct cgggagagag 180 

tggacatgaa accctttgtg ctctgaagca ttcaacagaa gctttctggt tctgtgccta 240 

tttctttggc acttgagcgt gtttgcaggt tcattacaca catgatgaaa gctctggccc 300 

<210> 144 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 144 

cctgactgag tgcctgacgg tggaccccct cagtgccagc gtctgaaggc agctgtaccc 6 0 

taagcacctg tcacagtcca gccttctgct ggagcacttg ctcagctcct gggagcagat 120 

tcccaagaag gtacagaagt ctttgcaaga aaccattcag tccctcaagc ttaccaacca 180 

ggagctgctg aggaagggta gcagtaacaa ccaggatgtc gtcacctgtg acatggcctg 24 0 

caagggcctg ttgcagcagg ttcagggtcc tcggctgccc tggacgcggc tcctcctgtt 3 00 

<210> 145 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 145 

gccagagcct agaggagaga tcaaagaccc tggccgaagt gaagcccatt ctgcaagcaa 6 0 

ctgggttccc atggcatgtg gtggccttag aggaggtgtt cagcctgcca ccgtcggtgc 12 0 

tttggtgctc tgcccaggag ctggtgggat ccgagggggc ctacaaggcg gccgtggaca 180 

gcttcctcca gcagcagcat gtgctggggg ccgggggtgg tcctggnccg actcaagggg 24 0 

annnnnnnnn nnncncaacc cccgctggac ccccngaanc tggcaagacc ngctgcccct 3 00 
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60 



<210> 146 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 146 

tgactttgta cctggtccaa gctgttgggg aattgctgct gttgacccag gcaggagtct 

gactagagaa caaactaagg ttgctgcaac aaacaaggac ctcttccaag aagggctccc 12 0 

aggcctggcg cagtgactca tgcctgtgat cccagcactt gggaggccga ggcgggtgga 180 

tcatttgagg ccaggagttc gagaccagct tggccaacat gatgagaccc cgtctctatt 240 

aaaaatacaa aaattagcca ggcgtggtgg cgcctgtagt cccagctact caggaggttg 300 

<210> 147 
<211> 295 
<212> DNA 
<213> Homo sapiens 

<400> 147 

ggnaangcna nngnaggaga nagagaagna ncagtnnagn cccangaaac ccnntgaaac 
ccttagaagn cagaggagng aaaggangaa aaananggnn ggangagaac nnannnnggn 12 0 

caaannaagg anganngnta ggngngaaaa anaanaacaa anggggaaaa ngggaaaaaa 18 0 

ggcganaaag gnaanannag nanaaggngg aananannnn annagaaagg ncaanaaaag 24 0 

aagnacaaag aaaaangana anaagnaann annanannga cagagacaag aagga 



60 



295 



<210> 148 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 148 

cgctgtgctt gagaccaacc tgacgggtac cttctacatg tgcaaagcag tttacagctc 
ctggatgaaa gagcatggag gatctatcgt caatatcatt gtccctacta aagctggatt 
tccattagct gtgcattctg gagctgcaag agcaggtgtt tacaacctca ccaaatcttt 18 0 

agctttggaa tgggcctgca gtggaatacg gatcaattgt gttgcccctg gagttattta 24 0 

ttcccagact gctgtggaga actatggttc ctggggacaa agcttctttg aagggtcttt 

<210> 149 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 



300 



60 



<400> 149 

agtgtcagtt ttcctaatct cagtccaggt aggaattaag aaatatctca agtgttgatg 

ctatccaagc atgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 12 0 

aatatctttt cttttgagag tacccccagt ttatttctac tgtgctttat tgctactgtt 180 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc atagcttttt aggacttggt 24 0 

gttaaaggag ccagtggtct ctctgggtgg gtactataat gagttattgt gacccacagc 3 00 

<210> 150 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 150 



tgtagacttt atgtcagttc tgtgtagact ttatgtcagt ttttgtcatt atttgaaaat 
ctattctgac aactttttaa ttcctttgat cttataagtt aaagctgtaa caactgaaat 
tgcatggatc aagtaagcat agttttatcc agggagaaaa ataaaaggaa gccatagaat 
tgctctggtc aaaaccaagc acaccatagc cttaactgaa tatttaggaa atctgcctaa 
tctgcttata tttggtgttt gttttttgac tgttgggctt tgggaagatg ttatttatga 



60 
120 
180 
240 
300 



<210> 151 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 151 

gcgggcccgg ccagcggaag cccctgcgcc cgcgccatgt caaagaaaaa aggactgagt 6 0 

gcagaagaaa agagaactcg catgatggaa atattttctg aaacaaaaga tgtatttcaa 12 0 

ttaaaagact tggagaagat tgctcccaaa gagaaaggca ttactgctat gtcagtaaaa 180 

gaagtccttc aaagcttagt tgatgatggt atggttgact gtgagaggat cggaacttct 240 

aattattatt gggcttttcc aagtaaagct cttcatgcaa ggaaacataa gttggaggtt 300 

<210> 152 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 152 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 60 

atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 12 0 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 18 0 

gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 24 0 

cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 3 00 



<210> 153 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 153 

cagagaccag ccttctccag aggctgtcac tgcaggagcc gtgggcctgg gaagacttgg 6 0 

aagcggcctc tctcaactgg tttctgtctc cgtggagctg gaactgcctg cacttgcctt 120 

cagagggagg cacagtccac ccagatccac ctttccagca agacccccag tggctgccca 180 

gcctgggagc acctctttgc ttttcacacc aaaccaaaac tggcgagagc ccctcctagc 240 

caccagtgat ccccaagcat ccagtacaga accaggcatc gagctagctc cctgcacggc 300 

<210> 154 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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<400> 154 

cttgacctct gtactttaaa gaaatcacta accaaatttt caaagtttcc ttttaaatgc 6 0 

gtttagctag aaatctatgt atttatccct ttcctatttt gcattcttct cccactattt 12 0 

ttaaaaactc atttacagta gaaaccattc ttctttctcc caacagtatc ctttgccaag 18 0 

accatgagaa cagtatggga gcatgttgtt ggtcagggtt tcagaatacg cgtgatgtca 24 0 

ctgagaatgt ttgctcacag tcaataattg tctttgtgga tgtgataatt ttggagatac 300 

<210> 155 

<211> 81 

<212> DNA 

<213> Homo sapiens 



<400> 155 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 
ggggaggtgt gggaggtttt a 



60 
81 



<210> 156 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 156 

ggcagcacaa gtgtgcaaac agctatggaa agtgaactcg gagagtctag tgccacaatc 6 0 

aataaaagac tctgcaaaag tacaatagaa ctttcagaaa attctttact tccagcttct 120 

tctatgttga ctggcacaca aagcttgctg caacctcatt tagagagggt tgccatcgat 180 

gctctacagt tatgttgttt gttacttccc ccaccaaatc gtagaaagct tcaactttta 240 

atgcgtatga tttcccgaat gagtcaaaat gttgatatgc ccaaacttca tgatgcaatg 3 00 

<210> 157 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 157 

ctggtgagga gtctttgcga gagcgaggag cagcggttac tggaacaggt gcatggcgaa 60 

gaggagcggg cccaccagag catcctgaca cagcgggtgc actgggccga ggcgctgcag 120 

aaacttgaca ccatccgcac tggcctggtg ggcatgctta ctcacctgga tgacctccag 180 

ctgattcaga aggagcaaga gattttcgag aggaccgaag aagcagaggg cattttggat 24 0 
ccccaggagt cggaaatgtt aaactttaat gagaagtgca ctcggagccc actactgacc 



300 



<210> 158 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 158 

cgacagctct ccaatactca ggttaatgct gaaaaatcat ccaagacagt tattgcaaga 60 

gtttaatttt tgaaaactgg ctactgctct gtgtttacag acgtgtgcag ttgtaggcat 120 

gtagctacag gacattttta agggcccagg atcgtttttt cccagggcaa gcagaagaga 18 0 

aaatgttgta tatgtctttt acccggcaca ttccccttgc ctaaatacaa gggctggagt 24 0 

ctgcacggga cctattagag tattttccac aatgatgatg atttcagcag ggatgacgtc 300 
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<210> 159 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 159 

agtacccaga gttgcgagga gttttttaac tgatttagcc aggtggcaat catgagtgaa 6 0 

tggatgaaga aaggcccctt agaatggcaa gattacattt acaaagaggt ccgagtgaca 12 0 

gccagtgaga agaatgagta taaaggatgg gttttaacta cagacccagt ctctgccaat 180 

attgtccttg tgaacttcct tgaagatggc agcatgtctg tgaccggaat tatgggacat 24 0 

gctgtgcaga ctgttgaaac tatgaatgaa ggggaccata gagtgaggga gaagctgatg 300 

<210> 160 

<211> 294 

<212> DNA 

<213> Homo sapiens 

<400> 160 

ctttgagcta ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt 6 0 

ggaaatgtga tct toccata tcatcaagaa acttgttttc tggatgaata ctgggagaat 12 0 

aaaatgagaa ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac 180 

agaaggtata attttttgac accctttccc acctggtgcc tatgctaggc ttgtnctgat 240 

aacatccctc actnactnga tnntcacatn gnncttncnc tgangtccca tttt 2 94 

<210> 161 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 161 

cttcctcaaa gcatggttgc tgagtaccca gagttgcgag gagtttttta actgatttag 6 0 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 12 0 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 180 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 24 0 

tgtgaccgga attatgggac atgctgtgca gactgttgaa actatgaatg aaggggacca 3 00 

<210> 162 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 162 

gccccgtgtg gggagacgga cagcaccctc ctcatctggc aggtgccctt gatgctatag 6 0 

cgcctcccct ctcccctcag agggcacagc tgcaggcctg accaaggcca cgcccggctc 120 

tcgtgctcta ggacctgcac gggacttgtg gatgggcctg gactctccag aaactacttg 180 

ggccagagca aaggaaaacc tcttgtttta aaaaaatttt tttcagagtg ttttggggag 24 0 

gagttttagg gcttggggag agggaggaca catctggagg aaatggcctt ctttttaaaa 300 

<210> 163 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 163 



aggtttcctc 


tggtctgtgg 


60 


acccgaggct 


tgggagacca 


120 


tttctctcct 


gcttccctga 


180 


gatgatgtgc 


tagtcctggg 


240 


tctgccactg 


tggacagggt 


300 



<210> 164 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 164 

aaaatttata ngtaatgaca aatgacttat cagtgttcat catctgaaag ctaagtggtt 60 

cgttcaatca ctttttcaaa gttgatagta gattgcatgg tttcatgttt cctcatattg 120 

gtttattaat tctatttaat caaggaaaat aacttcagat tccataaagt ttcagtttat 180 

ttttagttta ctactaggtg agatagcaca ttacatactt ttactatcaa atattatttt 24 0 

agcagcttcc catagtacca aatgatttga ttccctactc tcatttttta aagcatataa 300 

<210> 165 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 165 

ctggactctg agtcgtcttg gtcccaggag ccagtagtga aggcaacagt ctgcccacct 60 

gtggacacca gatcctggga gctcctggtt agcaagtgag atctctggga tgtcagtgag 12 0 

gctggttgaa gaccagaggt aaactgcaga ggtcaccacc cccaccatgt cccaggtgat 180 

gtccagccca ctgctggcag gaggccatgc tgtcagcttg gcgccttgtg atgagcccag 240 

gaggaccctg cacccagcac ccagccccag cctgccaccc cagtgttctt actacaccac 300 

<210> 166 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 166 

cttctgttga ttggtttgtt taaagtacct aagtactacc ctttgactcc ctaccaaaag 60 

ttcttttgtt ttttaaacaa cttttatttg tgacttactt tcttgagaag tgttcttaat 120 

gaattgcata aaatagtggt agcagcttat ttcttaagta ctttattatt tgtgctttac 180 

catttcaggt tcttatcttt aacccttatt tactcagttt tccatctgaa tgatcctatc 240 

tctaaattaa ggatttaata aatgctgcaa attgtccact ttgcaaattg tccaaaagct 300 

<210> 167 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 167 
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gcgagatgaa gctacactgt 
ttaggaaccg gggcaagggc 
gtcagccgcc ggtccgagcc 
gctatgagct aagggagaac 
ccactgttcg gttaaaggag 



9 a ggtggagg tgatcagccg gcacttgccc gccttggggc 
gtccgagccg tgttgagcct ctgtcagcag acttccagga 
ttcctgctca tctccaccct gaaggacaag cgcgggaccc 
attgagcaat tcttcaccaa atttgtagat gaggggaaag 
cctcctgtgg atatctgtct aagtaaggat tccatatggc 



60 
120 
180 
240 
300 



<210> 168 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 168 

gtctgggcag cctacgcttt ccggataaaa atggcagaat gaaagaatta tgagtggaac 6 0 

tagagaatag gaaagacatg aaccaacgcc caaaatgaga aagaaggaca tataaagaaa 120 
aagacaaata caagtgaaaa aaatatacta atggattaac gtccctgtcg agtgacattt 18 0 

tctgactatg gaaatgatat tagacaaaaa gcaacttcaa gtgggtttct tatttgagtt 24 0 

caaaatgggt cataacgcag catagataac ttgaaacatg aacagcgcat ttggcccagg 3 00 

<210> 169 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 169 

gagatctctg ggatgtcagt gaggctggtt gaagaccaga ggtaaactgc ggaggtcacc 6 0 

accct caeca tgtcccaggt gatgtccagc ccactgctgg caggaggeca tgctgtcagc 120 
ttggcgcctt gtgatgagee caggaggacc ctgcacccag cacccagccc cagcctgcca 18 0 

ccccagtgtt cttactacac caeggaagge tggggagece aagecctgat ggccccgtgc 24 0 

ectneattgg gncccctggc tanttcancn agncccncag gtngagncca aageca 296 

<210> 170 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 170 

gggtgttgga gcagattgta gttgatccac agcaaagagc atcaccaaag ccattccagg 6 0 

aggaactaga tccaccactt cctctgctgg gcatgctcca aaaatggttg tggcttccag 12 0 

agaggactcc aaaagaaagc acaaaaacta gacagtggga gggcataccc aaaagccctg 18 0 

agtttctgaa aaaatattga aagtttctat ggtgaaatag gaagttaatg tgcttaggaa 24 0 

gaaaaaagtg gtaatgattc aaggaaacat aatcacacac ggttttagtt ttaatggaca 300 

<210> 171 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 171 

atggaggcac cagcaggtag tggcccctgt aagcagggee agagteggga caaagagcag 6 0 

gagtgaagca gecaagagae agaggaccag getggageca gtgggcacgc aggagectge 12 0 

ctgggaaaag ceggggggea aggctggcat gggaatgaac acctgctggt gacacctctc 18 0 
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tgagcttcag ttcccttaac tagaaaaata gaacaggccc ggtgcggtgg ctcatacctg 24 0 

taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcaggag atcaagacca 300 

<210> 172 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 172 

ggcggaggag cagaagctca agctggagcg gctcatgaag aacccggaca aagcagttcc 6 0 

aattccagag aaaatgagtg aatgggcacc tcgacctccc ccagaatttg tccgagatgt 12 0 

catgggttca agtgctgggg ccggcagtgg agagttccac gtgtacagac atctgcgccg 18 0 

gagagaatat cagcgacagg actacatgga tgccatggct gagaagcaaa aattggatgc 24 0 

agagtttcag aaaagactgg aaaagaataa aattgctgca gaggagcaga ccgcaaagcg 30 0 

<210> 173 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 173 

gtctttccca ttcacttctc tagaaagctg ccaagacaga ggcagaaaga aatggatgat 6 0 

agttctgtca agcacacttc tgttctctta gaacttagaa gtgtttctaa gagaacagaa 12 0 

gtaataagag aaacagttac gtgtggaatt caacatcttt ggttggaacg cattggcttt 18 0 

ttttttcttg ttttgataga aatggaatta agcaaaagta gtttttgtct tttctgttgt 24 0 

cttcaaattt caggccatct atttttaatt taatcccgtt caagtacttg attgttatac 300 

<210> 174 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 174 

attatttcca aagcagccta cagtagaaaa tagtcattat ggcagcagct tctgatgttt 6 0 

ttgtttggta ggttttctga tttcaatata tagaatcata ttcatagagt atcttctttt 120 
aacgaattgc acaaagtacc catttaaaat ttacatgcac agttcattgc cacctttctt 18 0 

aggcctatgc atagttaata aggttataat ctactcaaca tggaaaatgg agcctatttg 24 0 

caaacacaca agtaattaaa gtaccaattc tctcttagtt tcttttttta tagttggttt 300 



<210> 175 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 175 

tgganactct ttantatgga aggtgaattt cctgtcaaca tagtccagga caaagcagtt 6 0 

ccaattccag agaaaatgag tgaatgggca cctcgacctc ccccagaatt tgtccgagat 12 0 

gtcatgggtt caagtgctgg ggccggcagt ggagagttcc acgtgtacag acatctgcgc 18 0 

cggagagaat atcagcgaca ggactacatg gatgccatgg ctgagaagca aaaattggat 24 0 

gcagagtttc agaaaagact ggaaaagaat aaaattgctg cagaggagca gaccgcaaag 3 00 
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<210> 176 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 176 

tataaacttt attttattct cttctggggt agagttacat gacaagaaat tgaattaatt 6 0 

caataaaatt ttagttcggg ttgcttaggt ttttactgct cccattcttg cttttactaa 12 0 

tttatccaag attagatgtg attactattt aataataatt tagtcctcac acttacaaac 180 

cacttacaat accagcatgc ttctatcact gtaattctat tcaattctca ggcccatgag 240 

gcatgccagc cagacgacca gacagcattt atagagaggg cactcaatac cagccacaaa 300 

<210> 177 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 177 

gactggagaa gtcagaagta gaaaagcaga ttgctaggag agacaggatg acagattttg 6 0 

gtcagaaaat gggatattgg agtttaaagt atcaaataca gaatagttcc agatgttcag 12 0 

agatccagca tgggattagg tactgaaatg gattagaact aaaagtcact agaatttaga 18 0 

aattgagaac catgagagtg gatgcaatga cttgttgctt gattgaaaaa taaattaata 24 0 

ataataaagg accatgagac tagcctgtta tagggggtat ctccatgann nttgtttttc 300 



<210> 178 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 178 

tcctggtgtc aaacactata aacctttgac cagctgagct gtgactgctg tcacatatct 6 0 

gagtcctgtg tgcacagtaa tatcctgggt caggtaaaat ccaggtcttc aagttttaag 12 0 

gattttttga agaattcggg cttctttaag acgatccatg cccaaatcca caagcttgtt 180 

gacagtggat tacagtttgt gtggcaaagt ccaagttgtt acactgtgct ttaaaaaaaa 24 0 

tcttatctgc atgtattgtt aacttagaga ccatgagatc tatttatcag gaccaggaag 3 00 

<210> 179 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 179 

ctcatgcctg taatcccagc actttgggaa gcagaggtgg caggatcatt ccagcccagg 6 0 

agttcaagac cagcctgggc aacacagtga gtgagaccct gtctctattt aagaaaaaat 12 0 

aattaagaaa ttttattaaa aaagaagaat caggaaacca agtccaaccc aactaaacct 18 0 

caaatgaacc agcccctaac acagatgagg ggatttggga ctgataagct ctgtgctgtg 24 0 

tccatggccc gtcatttatc aaggctgcag ctttgtaaat gtggctattt ttatgttgtg 300 

<210> 180 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 180 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 12 0 

gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 

gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 24 0 

ctcttgtaag gcaggaggat agaagaggac caagaaggga gggagctgcc ctgtggtgca 300 



<210> 181 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 181 

cccatgccgg gatcttccca cacccgtcct cacagatcca gccccagccc cttgcttccc 6 0 

aggccatctc tcagcagcac ctgcaggatg cgggcacccg ggagtggagc cctcagaacg 12 0 

catccatgtc ggagtctctc tccatcccag cttccctgaa cgacgcggct ttggctcaga 180 

tgaacagtga ggtgcagctc ctgactgaaa aggccctgat ggagcttggg ggtgggaagc 24 0 

cgcttccgca cccccgggcg tggttcgtct ccttggatgg caggtccaac gctcacgtta 300 



<210> 182 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 182 

tttgcagtgt tgtcagaaac aaataataaa gccccaaaag attaactagt tgaaaaaact 60 

ggcaaaatct gtatacgtgg aaatttacca ggacagagac tgaagaataa agaaaatgag 12 0 

tttcattgcc agatcatgaa atccaaagaa actttaaaga agatgagttg tgtaaatgga 180 

actgaaggga gggaagagct gccttcgcct gggacaaaga aaacatgtgt atacacatgg 24 0 

gtcaagcagt gctggtctgt ggctgcctgt ccagaggaat ggaaatatcc cttgtcttta 300 



<210> 183 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 183 

cggacccatc ggagcgtaac ctggatctcc gcaggcctgg cggaggccgg ccacctggag 6 0 

gggcattgct tggttcgcgt ggtagcagag gagcttgaga atgttcgcat cttaccacat 120 

acagttcttt acatggctga ttcagaaact ttcattagtc tggaagagtg tcgtggccat 180 

aagagagcaa ggaaaagaac tagtatggaa acagcacttg cccttgagaa gctattcccc 240 

aaacaatgcc aagtccttgg gattgtgacc ccaggaattg tagtgactcc aatgggatca 300 



<210> 184 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 184 
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ctgttttgca gatgaggaaa ctgaggtaca gaattcttag ggaacttacc caaaatggct 6 0 

tttctgcact ctgccctttg gtattgtccc atgtgaattg tttaaaactt atgtgtatag 120 

tggcatgagt aggtgatttc agaaacagaa ctcacttttg ttgtttggtc ttaaaattag 180 

gaacttttct tcatctgggc ttcatttccc tgcaccttcc cagctttcta gtcatgcaag 240 

ccacatgtct ccacgtgagg ggttcattgg aaagcagcca cagagccacc ccctggctgg 300 



<210> 185 
<211> 260 
<212> DNA 
<213> Homo sapiens 



<400> 185 

attatagaga ttaatctcct ttgctcgaag tctatttaaa tattagtcac atctaaaaca 60 

tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 

cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 

cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 

ctcccacctt taacatagtt 260 



<210> 186 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 186 

gataaactct tcagtgacga atattagaaa aagttagtta tacatttgag gaaaactata 60 

aaagtaccaa taatgagtag gaaatcactt ctgcagtatt tttggagcat tttccttaag 120 

catgacataa aagccaaagg tcacaaggga aaaaactgat agatttgtct gtgatattga 180 

gagatgtatg cacatataca tacaacagtc atagtaagac accgttagac aaaaggtgat 240 

gtatgaaaaa gaggcaaaac aacaagaaga aaagattgaa aaaatgagag ctgaagacgg 300 



<210> 187 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 187 

aaaaagtaaa gcttttcatg agcacaaatc ccttgcattg tttgatgtta ctgatattcg 6 0 

taaaatgaat attttttgtt ttgttttgtt ttattttttt gagacaagtc ttgctttgtt 120 

gcccaggctg gagtgcaatg gcatgatctt ggctcactgc aacccctgcc ttgcgagttc 180 

aagtgattct tctgcctcag cctcctgagt agctgggatt acaggcgctc accaccacac 24 0 

ccagctaatt tctgtatttt tagtagacac agggttttac catgttggcc angctggtct 300 

<210> 188 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 188 

gagcattcct cctttgttaa cgaagcaaca tttacacaag atggacatta cattattagt 6 0 

gcatgctctg atggcactgt aaagatctgg aatatgaaga ccacagaatg ttcaaatacc 12 0 

tttaaatccc tgggcagcac cgcagggaca gatattaccg tcaacagtgt gattctactt 180 
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cctaaaaacc ctgagcactt tgtggtgtgc aacagatcaa acacggtggt catcatgaac 24 0 

atgcaggggc agattgtcag aagcttcagt tctggtaaaa gagaaggtgg ggactttgtt 30 0 

<210> 189 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 189 

ctaatatcca gaatctacaa agaactcaac aagaaaaaaa ccaaccccac aagcgggcaa 6 0 

aggacatgaa cagacatttc ccaaaagaag acatacaagc aacctaaaat aatctaaaat 12 0 

aatttttaaa aagaaaaaat gcttgacaga gttttgatag tacttagtaa aaagttatat 18 0 

ctagtggctt tttgtttgtt tgtttttgtt ttgtttttaa gaaatagtct ctgtttccca 24 0 

agctggagta cagtggcgca atcttggctc actgcaacct cgaactcctg ggctcaagcg 30 0 

<210> 190 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 190 

aaccactatg gaggcatgat tggtggccac tacactgcct gtgcacgcct gcccaatgat 6 0 

cgtagcagtc agcgcagtga cgtgggctgg cgcttgtttg atgacagcac agtgacaacg 12 0 

gtagacgaga gccaggttgt gacgcgttat gcctatgtac tcttctaccg ccggcggaac 18 0 

tctcctgtgg agaggccccc cagggcaggt cactctgagc accacccaga cctaggccct 24 0 

gcagctgagg ctgctgccag ccagggacta ggccctggcc aggcccccga ggtggcccca 300 

<210> 191 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 191 

gcggcgctga cccggccggc cccacacccg ctcttcctct tctttgccgc ggactccctt 6 0 

tcctgcctcc aagacctggt gtctcccact gtgagcccag ctgtcccaca ggcagtcccc 12 0 

atggacctag actcaccttc cccttgcctc tatgaacctc tgctgggccc agcccctgtc 18 0 

ccagctcccg acctgcactt cctgctggac tcaggcctcc agctccctgc ccagcgagcg 24 0 

gcctcagcca ccgcctcccc tttcttccgg gccctgctgt caggcagctt tgcagaagcc 300 

<210> 192 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 192 

gacagaccgt tgagaggacg tggaggcccg agagggggta tgcgcggcag aggcagaggt 6 0 

ggccctggga acagagtttt tgacgctttt gaccagagag gaaagcgaga atttgaaaga 12 0 

tatggtggga atgacaaaat agcagtcaga actgaagaca acatgggtgg atgtggagtt 18 0 

cgaacctggg gatcgggtaa agataccagt gatgtggagc caactgcacc gatggaggaa 24 0 

cccacagtgg tggaggagtc ccagggcacc ccggaagagg agtctccagc caaagttcct 30 0 
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<210> 193 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 193 

ctcaagaaag gagaagtttt tttgtatgaa attggaggaa atattgggga acgctgcctt 60 

gatgatgaca cttacatgaa ggatttatat cagcttaacc caaatgctga gtgggttata 12 0 

aagtcaaagc cattgtagaa gacttaacaa gctgcagata accatgtgga cttctgtcat 180 

aattcttgct gagtcaagag tgtaaataaa agaaatggca ggactcatat tattcagttg 240 

tacccaagta tttaaaaatg actctcttaa gccttaaaaa gtcatagatt tgtgctgctg 300 

<210> 194 
<211> 300 
<212> DNA 
< 2 1 3 > Homo s ap i ens 

<400> 194 

cagaagctta gtcatatttc aaaatgatca aatatcaaga aaaattctga gctgcataac 60 

ttgtataaag taattttcag tgattttttt catggttatg ataaaagaac tggattagca 120 

gaaactttta ccctgaatca agatttaatt tttctttgag ctcatcttaa ggatatcgga 180 

acatagggag caaacgatgg tgtggctgcc tcagtgcttg atttttaacg gttttgaaga 240 

gaatagttac atttcttctc ctagtaagaa ctaataaata cattaacaga aatgaattcc 300 

<210> 195 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 195 

ctctactaaa aatacaaaaa ttagctgggc gtggtggcac acacctgtaa tcccagttac 6 0 

ttgggaggct gaggcacaag aatcgcttga acccgggagg cggaggttgc agttagccaa 12 0 

gatcgccctg ctgcactcca gcctgggcaa cagagggaga ctctgtctcc aaaaacaaaa 180 

acaaaaactg ttagtgaagg ttccctggga cttttgatat tttaaaaatt gatcttatga 24 0 

ctaagtagat aaattcattg ccataatgag gctagctccc agataaacag cgtattttct 300 

<210> 196 

<211> 300 

<212> DNA 

<213 > Homo sapiens 

<400> 196 

tggatactga caatggtggc aggcatttca agccttttaa attagtactt tttgtcgtct 6 0 

tgcttattaa aattttgtta attttagcaa agaccaattg ttgtgataaa ctggtgtttt 12 0 

ttggatgctt caagcacacg ttaaccaatt ttttaattcc ccttttggtt cctcccattg 180 

ttctaaaata ggactttcat attattaaaa cctcaaaaga tgatccaccc aggatgaaca 240 

aagatcacca aggggaaaga aaacattttt tatctttaca gaaaacatgt taagattata 300 



<210> 197 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 197 

atccagatgg gatacctcta aacacgaaaa gaaagaagat tccattagtg aatttttaag 60 

tttggctaga tcaaaagccg agccacctaa acaacagtcc agccccttag taaacaaaga 120 

ggaagagcat gcaccagaat catccgcaaa tcagacagtc aacaaagatg tggacgcaca 180 

ggctgaagga gaagggagcc gcccatccat ggacttattc agggccatct ttgccagttc 240 

ctcagatgaa aagtcctcat cctccgagga tgagcaaggt gacagtgaag atgatcaggc 3 00 

<210> 198 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 198 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 6 0 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 120 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 240 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 3 00 



<210> 199 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 199 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 60 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 12 0 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 240 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 3 00 

<210> 200 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 200 

agtagaaaaa tacaaagact gtgatccgca agttgtggaa gaaatacgcc aagcaaataa 6 0 

agtagccaaa gaagctgcta acagatggac tgataacata ttcgcaataa aatcttgggc 12 0 

caaaagaaaa tttgggtttg aagaaaataa aattgataga acttttggaa ttccagaaga 180 

ctttgactac atagactaaa atattccatg gtggtgaagg atgtacaagc ttgtgaatat 240 
gtaaatttta aactattatc taactaagtg tactgaattg tcgtttgccc tgtaactgtg 

<210> 201 
<211> 300 
<212> DNA 

<213> Homo sapiens 



300 



<400> 201 
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ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 18 0 

ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 24 0 

agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 3 00 

<210> 202 
<211> 281 
<212> DNA 
<213> Homo sapiens 



<400> 202 

ggccatggga cagttgcaac agcagttaaa tggactgtca gtcagtgaag gtcatgattc 6 0 

tgaagatatt ttgagcaaaa gtaacctgaa cccagatgcc aaggagttta ttccaggaga 120 

gaagtactga gccgagaaag ctttgaggaa gacttgtctg tccccacatc tggggatagt 180 

aatgcccaaa atggtggagc tgaagagggg gatggggcgg gcgaggggtg cacagcggga 24 0 

aggggagtgg tggtctcacg atactgtgac tctgagtaac t 2 81 

<210> 203 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 203 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 

cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 

tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 

ctccagccct tgctccactc atgcctgggg gcctggggct gagtggtatc cctacctggc 240 

ctccccctgg cctctgggcc tccagcgctg ggtttgtcga gtgagagaga gagaggagct 300 

<210> 204 
<211> 269 
<212> DNA 
<213> Homo sapiens 



<400> 204 

gcggactctc aggacgaaaa gagccaaacc tttttgggaa aatcagagga agtaactgga 60 

aagcaagaag atcatggtat aaaggagaaa ggggtcccag tcagcgggca ggaggcgaaa 12 0 

gagccagaga gttgggatgg gggcaggctg ggggcattgg gaagagcgag gagcagggaa 18 0 

gaggagaatg agcatcatgg gccttcaatg cccgctctga tagcccctga ggactctcct 24 0 

cactgtgacc tgtttcagga gcctcatat 26 9 



<210> 205 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 205 

ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 12 0 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 18 0 
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ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 
agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 3 00 

<210> 206 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 206 

gggattacag gcatgaccca ccgcgcccag cctgtaattt cttatacttg gtattttgta 6 0 

cttggattat gcttctgata cgctataatt atttatgtac atgttatttt tcttcaatag 120 

actgtgaact cttcgaatgt aggactccta gagctagata ctcaattatt ttttattaaa 180 

ttgaatgact tgaaactaca gatcctttat ttaaacttcc caaatttctg ctttatctag 240 

gcaactcttt aaattctttg atctcatgta gattccaaag gctgaaataa ttgagatttt 300 



<210> 207 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 207 

tcctgaagct cggggggctg caggtcctga ggaccctggt gcaggagaag ggcacggagg 6 0 

tgctcgccgt gcgcgtggtc acactgctct acgacctggt cacggagaag atgttcgccg 12 0 

aggaggaggc tgagctgacc caggagatgt ccccagagaa gctgcagcag tatcgccagg 180 

tacacctcct gccaggcctg tgggaacagg gctggtgcga gatcacggcc cacctcctgg 24 0 

cgctgcccga gcatgatgcc cgtgagaagg tgctgcagac actgggcgtc ctcctgacca 3 00 



<210> 208 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 208 

attccaaagg tttcaaagaa cttggtcata aatatgataa tgagaagaca aagtatttat 6 0 

attaaaacag tttagtagcc ttcagttttg tgaaaatagt tttcagcaca gaaactgact 120 
tctttagaca aagttttaac caatgatggt gtttgcttct aggatataca ctttaaaaga 18 0 

actcactgtc ccagtggtgg tcattgatgg cctttagtaa attggagctg cttaatcata 24 0 

ttgatatcta atttctttta accacaatga attgtcctta attaccaaca gtgaagcact 300 

<210> 209 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 209 

gagacagcag cccccaggga atgaagctga tgccagagtc agacccgagg aggaagagga 6 0 

gccactgatg gagatgcggc tccgggatgc gcctcagcac ttctatgcag cactgctgca 12 0 

gctgggcctc aagtacctct ttatccttgg tattcagatt ctggcctgtg ccttggcagc 180 

ctccatcctt cgcaggcatc tcatggtctg gaaagtgttt gcccctaagt tcatatttga 24 0 

ggctgtgggc ttcattgtga gcagcgtggg acttctcctg ggcatagctt tggtgatgag 3 00 



50 



BNSDOCID: <WO 9933982A2_I_> 




WO 99/33982 



PCT/US98/27610 



<210> 210 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 210 

gtaacgtgac acgtatttta cttcttttag taggcggaca cactttctta aagtggtaat 60 

acgtcatggc cctgctataa ggtagtagtt ctagaagact gtttatctaa taattcagac 12 0 

taaagctatt tatattgctg tgacaccacg tggaaaactt ttataattcc atcttatttc 180 

tgatgtatat gttttatttt ctctgccttc ataagaacta aaaaccaaag ttatttacgt 240 

gaaaacaaga tttttgtttg agttcattta cttgagatat gtttaaaaaa tccaccttct 300 

<210> 211 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 211 

gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 

cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 12 0 

aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 180 

aatttggaga aagagctcca ggagatggag gcacggtacg agaaggagtt tggagatgga 24 0 

tcggatgaaa atgaaatgga agaacatgaa ctcaaagatg aggaggatgg taaagacagt 3 00 

<210> 212 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 212 

gcctgctgct tcatgccgcc ggcgtcctgc tccacgtctc tgtgctgctg ggccctgcac 60 

tgtcggccct gctgcgagcc cacacgcccc tccacatggc tgccctcctc ctgcttccct 120 

ggctcatgtt gctcacaggc agagtgtctc tggcacagtt tgccttggcc ttcgtgacgg 180 

acacgtgcgt ggcgggtgcg ctgctgtgcg gggctgggct gctcttccat gggatgctgc 24 0 

tgctgcgggg ccagaccaca tgggagtggg ctcggggcca gcactcctat gacctgggtc 3 00 

<210> 213 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 213 

ggtatggttg gagtgtagga atgaatattc atgaaatgtt tcttattgct tttccttccc 60 

taattcatac aatgaatgta tttggaatac ttacatatta taaaataaac tatacctctt 120 

caagaggtat cctgttctgt aagatcagat gtttttattg caggtcaata taatactgcc 18 0 

agagacagaa aataccccct tatcagtccc ttagtgcctc tttctgtttg tggcatggtg 240 

agaaaaccca tgctgaaaag attgtacttt gtgatcccaa tcagagggag gagctaatct 3 00 



<210> 214 
<211> 300 
<212> DNA 
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<213 > Homo sapiens 



<400> 214 

ggaaagggcc ctaaaagaga tgaacaatac ccgtatcatg tggaatgaat tagaaaccct 6 0 

tgtcagagcc catatcaaca actcagagaa acatcaaaga gtcttggaat gtctgatggc 12 0 

atgcaggagc aaacccccag aagaggaaga acgaaagaaa cgaggaagaa agagggaaga 18 0 

caaagaggac aagtcagaga aagcagtgaa agattatgaa caggaaaagt cttggcaaga 24 0 

ctcagagaga ttaaaaggaa tcttagaacg tggaaaagaa gaattggctg aagctgagat 300 

<210> 215 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 215 

atacttttta aacctttttt ggcagctcag atggtgtaaa ttttaaaatt ttgtataggt 60 

atttcataac aaaaatatgt atttcttttt tgttatttta tcttgaaaac ggtacatatt 120 

ttagtatttg tgcagaaaaa caagtcctaa agtatttgtt tttatttgta ccatccactt 180 

gtgccttact gtatcctgtg tcatgtccaa tcagttgtaa acaatggcat ctttgaacag 240 

tgtgatgaga ataggaatgt ggtgttttaa agcagtgttg cattttaatc agtaatctac 300 

<210> 216 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 216 

gcagatattt actgaaggaa tctaggttgt attttcagtg gacaatggga ataaagcatt 6 0 

tctaaagcac cgactggaga ggaaggcaac agagacaagg agagaagccg agagacatgt 12 0 

ctgcgtgctg ccacgcatct gagcgattgc tctgtgaaga gttgtacact gaacattttc 180 

aggggaggct gtttacccag gcaatgtcct caaacaagcc tgtgccgggg agtcctggaa 24 0 

tctgtgccag gactgtgttt ttagcccttc acctctcagc tttagcagga catgaaccag 3 00 

<210> 217 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 217 

cccccatctt cactggttat tccacttatt taaaatgtcc agaataagca aatctccata 60 

tagaggaagt agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc 12 0 

ttttgcagtg ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata 180 

tgcttaagac cattgaatta cacactttac gttggtgaat tgtatggtat gtaaattata 240 

gttcaataac atagttacaa aagataatca aaagcatgaa agcactgttg atgtggtttg 300 



<210> 218 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 218 
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acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 

actcggcata catctgcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccttgacc ttgtactaca 



300 



240 



60 
120 
180 



<210> 219 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 219 

caactagaga agattggaca gcaggtcgac agagaacctg gagatgtagc tactccacca 6 0 

cggaagagaa agaagatagt ggttgaagcc ccagcaaagg aaatggagaa ggtagaggag 120 
atgccacata aaccacagaa agatgaagat ctgacacagg attatgaaga atggaaaaga 180 
aaaattttgg aaaatgctgc cagtgctcaa aaggctacag cagagtgatt tcagcttcca 240 
aactggtata cattccaaac tgatagtaca ttgccatctc caggaagact tgacggcttt 3 00 

<210> 220 
<211> 260 
<212> DNA 
<213> Homo sapiens 

<400> 220 

ggtaagtcag gtgattgaat cccggaaagg ttcattgtct tcaagctcac aatactattt 60 

tgggacaaac agttgtctag tgtttggact catgaaccct gattcttgag ggtggtattt 120 

tactgctttt gtgatttggt ttcaacatat atagtctttt ctccggagtt accttaggtc 180 

agtggccagt gtttcagccc ctggaaaggg catgggctgc cactgaggtt ggtcacaggc 240 

ctctcagctc atggtgggag 26 0 

<210> 221 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 221 

gggttccatc ccttccaccc aggaaatgga ggcacgactt gcagcgttgc agggcagagt 60 

tctaccttct caaacccccc agccggcaca tcacacaccg gacaccagga cccaagccca 12 0 

gcagacacag gatctgctaa cgcagctggc agctgaggtg gctatcgatg aaagctggaa 18 0 

aggaggaggc ccagtgaccc tccaggacta tcgcctccca gacagtgatg acgacgagga 240 

tgaggagaca gccatccaaa gagtcctgca gcagctcact gaagaagctg ccctgatgag 300 

<210> 222 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 222 

gcggtgaccc acgtgtcctg catgattgcc ctactgctgt ggagacctcg tgctgaccat 60 

ctggcagtgt tcttcgtatt ctctggcctg tggggcgtgg cagatgccgt ctggcagaca 120 

caaaacaatg ctctctacgg cgttctgttt gagaagagca aggaagctgc cttcgccaat 180 
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taccgcctgt gggaggccct gggcttcgtc attgccttcg ggtacagcac gtttttgtgc 240 
gtgcacgtca agctctacat tctgctgggg gtcctgagcc tgaccatggt ggccgtatgg 300 

<210> 223 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 223 

gccccctctg gatcctgagc tccctgctct agacagtgat ggtgattcag atgatggcga 6 0 

agatggtcga ggtgatgaga aacggaaaaa taaaggcact tcggacagct cctctggcaa 12 0 

tgtatctgaa gggggaaagc cctcctgaca gccaggagga ctctttccag ggaagacaga 180 

aatcaaaaga caaagctgcc actccaagaa aagatggtcc caaacgttct gtactgtcca 240 

agtcagttcc tgggtacaag ccaaaggtca ttccaaatgc tatatgtgga atttgtctga 3 00 

<210> 224 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 224 

ctgcggccgc aggagctgtg gcggttttcc taatcctgcg aatatgggta gtgcttcgtt 6 0 

ccatggacgt tacgccccgg gagtctctca gtatcttggt agtggctgag tccggtgggc 120 

ataccactga gatcctgagg ctgcttggga gcttgaccaa tgcctactca cctagacatt 180 

atgtcattgc tgacactgat gaaatgagtg ccaataaaat aaattctttt gaactatgat 24 0 

cgagctgata gagaccctag taacatgtat accaaatact acattcaccg aattccaaga 300 

<210> 225 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 225 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtagt tcctgctttt cttcttggaa attcctcgtg gacctcgaga tctttaccct 120 

aaaatagttc tgttgaattt caccctggca atgtaaattg atagcttatc ttcacagatg 180 

ccagacaatg gacaactcac catcagtcct ctgctcacct gagacaaatg catgtctgat 240 

tgcttcctct gccctattgt ttatgtgaaa atgcagattc actgagccag actaaggcat 300 

<210> 226 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 226 

tatataacaa cttttgcttt caaagttggg tgggactaga acacacaatg gaaggatgga 6 0 

gtcaggagac ctggattctt gtgcccgctc tggcttttac agtctgccta actctatgca 120 

gtcacttcct gccagcctgt ttccttacct acaagaggga gagacactcc ctggccagcc 180 

tagttctcag ggtgaacgaa aggtcattat cactgcatcc tctagtcatt tgcttcttcg 24 0 

ctaattaaca catcttgagc acctgcgatg ttccaggaac aggagatggc agcgtgcaag 300 
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<210> 227 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 227 

ttgctgaaat gggcacttct gctgtgatgc tagtgttgat tttgctctca gatgaacaca 6 0 

atgtctcata ctaaccaaga agcaagaaaa gccccatgca ttcatttttc acttggagtg 120 
acaatgggag aggtcaggaa tcaagttcac tttcaagatc taagggagtc cactatctgt 180 
gcaattgtat ttggcttttt tttgcactgt ttcaatgctg gtaattgaaa ccattttaat 240 
atatttggtt gtattcactt tatatgtcct tccaaaaatg ttgttgtgta cataccatgc 300 

<210> 228 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 228 

gctatgtatt gtgtcctacc atgaattcac tccatgctag ccacattggc ctgtatggct 60 
attccttgga cacacctagg atgttcttgc ctcttagctt gcctaccttt ctctcatcat 120 
ttgggcctca gcgaggatat catctcctca gagaagcctt ctgtgaccat gctatctaaa 180 
atactccagc acttcagtca ccctttatcc cattactctg ctttttcaga aacattggtg 240 
ctccctgaaa catatttgtt tacttgctta gtgtcttttc tcccgcacta ccatgtaagc 300 

<210> 229 
<2ll> 300 
<212> DNA 

<213> Homo sapiens 
<400> 229 

gattttcgaa actcttcagc tacttgccct tttttatctg aaaccatcat accttctgaa 60 

agaaaaaagc atatcttcat tgacataaca gaagtgagat ggcccagtct tgatacagat 12 0 

ggtaccatga tatatatgga gagtggcatt gtgaagataa catctttaga tggtcatgca 180 

tacctctgcc tgcccagatc tcagcatgaa tttacagtac attttttgtg taaagttagc 24 0 

cagaagtcag actcatctgc agtgttgtca gaaacaaata ataaagcccc aaaagataaa 3 00 

<210> 230 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 230 

acttcttgtt tgcctttttt ataaggaaat gttggagagt tacatcattg ctaatgtaga 6 0 

aatgttaagt ggaaaaatat acagtttggt aaaataaact agattctaca tttatttgtg 12 0 

ggtttttttc ccctcctttc tttccacagc acttttgata tcaagcaagt ggcttccttt 180 

ttgagatatt aaaaaaaaaa agaaaaggaa aaaagtaaat gannnnnnnn nnnnnaaccc 24 0 

tttctnattn gnattngttt nagnattgng aagttgngtt aaanagtnct agntagaaat 3 00 



<210> 231 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 231 

tgattctttt tgttnttttt tttgatattg acaaaagctt anncnttncn attaaaaang 60 

ccactaatta gactttttan ntaaaaaang tagggggttt taaaactact ttcctactac 120 

caaaaaatca naaagtatct agctttctaa atngggaaag caagcaatgt tataaaaacn 180 

ctgaaggaat ctctttcttc gggacctttt gttaaactcg gttnaagctg taaaccttat 240 

ttaaaataaa atttaccaca naacaggaaa tanaacctgg ggaanactcn aaatacncct 3 00 

<210> 232 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 232 
ggaagccaag gcctggagct gcaggtcccc 
atggcctggt gcccccacct gctgcagcag 
tgctggggtg gtcctcatgg acagtgaggt 
ggatcacctg ggttccaggc catccttgct 
gcagaaaagg ccagaccctg ctgagttaga 

<210> 233 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 233 
gaggaagagg cctgctccac ttgtctggga 
gcctggagct gcaggtcccc cggcatctct 
gcccccacct gctgcagcag gagccccaag 
gtcctcatgg acagtgaggt gtgcaagggt 
ggttccaggc catccttgct gagcatcttt 

<210> 234 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 234 
ggaacataat tagcttactg atttgatggt 
gtttgccttt ctttaactct ggctccttct 
attgagtgag gaggacaggc agaactgtgt 
gctcttggtt ttttgctcac ttttatatgt 
agtagaggat cagatcattg ggacttgttc 

<210> 235 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 235 
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cggcatctct ctctgtcccg 
gagccccaag gagtgctagc 
gtgcaagggt gcactgaggg 
gagcatcttt gagcctgcct 
ggctgctggg atccactgtt 



gcagcccagg 6 0 

tgagggtggt 120 

tggtgggagg 18 0 

tccggtggga 24 0 

tccacacagc 3 00 



acctgggcag gaggcacaga ggaagccaag 60 

ctctgtcccg gcagcccagg atggcctggt 120 

gagtgctagc tgagggtggt tgctggggtg 180 

gcactgaggg tggtgggagg ggatcacctg 240 

gagcctgcct tccggtggga gcagaaaagg 3 00 



tctgtgtagt tcctgaaact 
ccttcttctg tttgtgtatc 
ctgccaagga ccggatgtac 
aaggtattag tacaaaccta 
tggtttcaag aaagaattaa 



cttggctctt 60 

tgtttaattc 120 

ttctttcctt 180 

aaggagagaa 24 0 

caaattgccg 300 
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gttggctcaa gggccaccag aagcatttct ttattattat tattttttaa cctggacatg 
cattaaaggg tctattagct ttctttccgt ctgtctcaac agctgagatg gggccgccaa 
ggagtgcctt ccttttgctc cctcctagct gggagtgacg ggtgggagtg tgtgtgccca 
ggtgggggtg tctcctggct gggaaggagg gaaagggagg gagagttttg cgggggttgg 
ca 9tggagag caggctggag aggagatggc taatagctgt ttaatggaaa cctgctgggc 



240 
300 



60 
120 



180 



<210> 236 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 236 

gaatcatcga aggttgagac cgtgtctagt tacatagtta taaataccca tctatgtact 60 

gatgccttct aaatgtctat ctccagtatg gtcttttcct ttaagctcta gatccattga 120 

caccctcacc atctctaaaa ggcatttcaa actgaacaca tctgatacag aacttttcat 180 

ttccttccca actttgccca cgccagcctg ctcctccttc acgctttcca cttagtatat 240 

gatcccacta ttcactcagt ctctgaagct taaaacctag gattcatcct tgactactgt 3 00 

<210> 237 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 237 

caggacatgg agcagtacct gtccactggc tacctgcaga ttgcagagcg gcgagagccc 60 

ataggcagca tgtcatccat ggaagtgaac gtggacatgc tggagcagat ggacctgatg 12 0 

gacatatcgg accaggaggc cctggacgtc ttcctgaact ctggaggaga agagaacact 180 

gtgctgtccc ccgccttagg gcctgaatcc agtacctgtc agaatgagat taccctccag 240 

gttccaaatc cctcagaatt aagagccaag ccaccttctt cttcctccac ctgcaccgac 3 00 

<210> 238 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 238 

cactggctac ctgcagattg cagagcggcg agagcccata ggcagcatgt catccatgga 6 0 

agtgaacgtg gacatgctgg agcagatgga cctgatggac atatcggacc aggaggccct 12 0 

ggacgtcttc ctgaactctg gaggagaaga gaacactgtg ctgtcccccg ccttagggcc 180 

tgaatccagt acctgtcaga atgagattac cctccaggtt ccaaatccct cagaattaag 240 

agccaagcca ccttcttctt cctccacctg caccgactcg gccacccggg acatcagtga 300 

<210> 239 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 239 

atttcctcca gtcctgggcc ccatccttga gggccttccc agccagccag caggagaggc 6 0 

aagaactggg ggaacacagg aacctagggg aggaggggag cgctgggcat cctcaggctg 12 0 

gcggccaagg cctgcccctg gaggcactag aggagggcat ctgtctgtgg gagcccagag 18 0 
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gctgcaggga ggaggaggag ggaggtatct ggtgtgagcg ttgcccctgc gacatttggg 24 0 

accacacagg tgggcttcct tattccctga caaagcctct gtttccagct cttccgccct 300 

<210> 240 
<211> 274 
<212> DNA 

<213> Homo sapiens 

<400> 240 
catgagtgat attttggtct gggtttcctc 
aaatgttttt aatatacatt cttattttgt 
taactttttg gggggtcttt tttggtgttt 
tatgtatgta ttttcttttt tgtggagtat 
tactcctcag gttgatgtat tcatgaaagt 

<210> 241 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 241 

ctgttgcctg ccaagctcag ggcccattta tcatgcatct tcccatcctt gtctccccca 60 

actgtccctt acctgagtca caatttcgcc aaagccaaag ggattgtcct aagccaatgt 120 

tgatttatca ctcttcctgc tcaaaagccc ccaagatcac ctatcaatca cctacttgag 18 0 

tgcaagcttt gactctgtca cctgacattc aagtccccct ctgcccccat gccagtctta 24 0 

tcccctcccc tacatatgcc ctatgcctca gtttgccttc cctccacttt aaaaagcctc 300 



<210> 242 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 242 
ccgctggcta tgtggacgct ggggcagagc 
agtttgccag gcagctacag ctctctgatc 
tccagcagga taccaagggt ttggtggact 
tggatggggg caggagcctg gaagagctaa 
gcggtggtgg ggggtgctta gtggctatgc 

<210> 243 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 243 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 6 0 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 12 0 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 18 0 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 24 0 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 300 



ttaagatttt agtttgtctg aattaaggaa 6 0 

cccacccctc cagaaataag ctggaaatct 120 

taatgggccc agaactgtgg tttaaatttt 18 0 

aaatttaaaa actggatttg ggacctaaaa 24 0 

ttta 274 



caggccggag tcgaatgatc agccaggaag 6 0 

ctcagacggt ggctggtgcc tttggctact 12 0 

tccgagatgt ggcccttgca ctagcagctc 18 0 

ctcgtctggc ctttgaggta atggggggtg 24 0 

tcaccccgct ccaggaggcc tattttggta 300 
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<210> 244 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 244 

agtaaatttt ttatgcatat tttattgcaa taaaaaatga aaacagtttc aatctaggag 60 

gattttggat gcatctatgc cttgagaaat gaatggtttg atgtaaatgc atggtagcaa 12 0 

gaataaataa ttatgttaat tcatataata tgttatatat agttttaaag aaaattctat 180 

cactgtcttc ctatgggtag ggctataatg tccagttctt tcagggatta agagggtagg 240 

gtctgaagtt aatccttgtt tgtcgtaatg ttattaattt attcaaccaa gacttaattg 300 

<210> 245 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 245 

tagacataga aaacatacag taagaatatg gtattataat cttacgggac cactgtcaaa 60 

tacgcggtct gtctttgaaa agttgtaatg cggcgcatga ctataaatac ctagctggtt 12 0 

agcatttaca ttccttgcca gggagtttga aatttatact atagaaataa ctttaggttt 180 

taggtagagt taaagaggta aagcacatgt tgccacaacc caggaaagta tttttaagaa 24 0 

agattggatt ttcctacctt tagagatcta aaaaaaattt aatataaaaa atcattttgt 300 

<210> 246 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 246 

tggaatattt gctgtgaagg gagaaaggga gagaaaactc ttctgaggat catttgtctt 6 0 

ggtagtatag taaaaccaac cagctgaacc tttcaggcta caagagaacc cgggtcggta 12 0 

atgtcttttt aagaataatt tttaattgct tataacaagc atattttgtg gcatttgaac 180 

tatatttact gctccaatat ccgttatttt ccaaaggatt ttgtatcttt ttgaaaatgt 24 0 

ttacatcatc agatgatcca cagaattcac tttatgtgag atctcccgag agtttccatc 300 

<210> 247 
<211> 300 
<212> DNA 
< 2 1 3 > Homo sap i ens 

<400> 247 

gtgttgctca gtgagcagac ccgactccag aaggacatca gtgaatgggc aaataggttt 6 0 

gaagactgtc agaaagaaga ggagacaaaa caacaacaac ttcaagtgct tcagaatgag 12 0 

attgaagaaa acaagctcaa actagtccaa caagaaatga tgtttcagag actccagaaa 18 0 

gagagagaaa gtgaagaaag caaattagaa accagtaaag tgacactgaa ggagcaacag 24 0 

caccagctgg aaaaggaatt aacagaccag aaaagcaaac tggaccaagt gctctcaaag 300 



<210> 248 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 248 

gagaggatca cttgagctta ggagttcaaa tccagcctga gccaacataa caagactttg 60 

tctctaaaca aaacagttat tgtttaaaga atctgaaatc ttcatcttta attcaggtag 120 

caatgaatcg agcccaagtt tgtttgatat ccagttccaa gtctggagag aggcatcttt 180 

atcttattaa agtatcgaga gacaaaatat cagacagcaa tgaccaagag tcagcaaatt 240 

gtgatgcaaa agggctatca aagggaggct ttttacagag aactaaggaa gagaaggagg 300 



<210> 249 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 249 

ctagcctggg caatatagta cgaccctgtc tttactaaaa atgcaaaaat taaccacgta 6 0 

tggtggctca cacctgtagt cctggctact gaggaggctg atgcaggaga atcatttgaa 120 

cccaggaggt caaggctgca gtgagctatg attgcaccac tgcaatccag cctggacaac 18 0 

acagtgagac cctgcctcac aaaaattata ttctgatttt ctgagtccat gaacacattg 240 

tccaaatgga tttttctagc tcctccaagt tacagatagt tccacgcaca cacagaactc 300 



<210> 250 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 250 

aggaaggtgg aggggcagga acaggacgga caggccccgg gctctggcac atcctgggga 6 0 

acaagggacc acaaggacgg gggcagtctc cagacttccc ctgggcgctt gaccccaggc 12 0 

cttgcagggg agagagccag ggcctccctc aggtctttgt tcatgctgtt ttccctgccg 180 

tggacaccct ttcccgctct ccgattctct aaatcctgcc ccatctccca gatcttgttc 240 

atgtccaagc ttttccagga agtcttagca gctcccacac cgcagagctc gagatgtctc 300 



<210> 251 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 251 

gaaggcagaa gtgtaaatga acatacagaa gaaggagaaa gcctgctgtg tttggcttgt 60 

tcagcagggt attatgaatt agcacaagta ttgcttgcta tgcatgctaa tgttgaagat 12 0 

cgagggaata aaggagacat aactcccctg atggcagctt ccagtggagg ttacttagat 180 

attgtgaaat tattacttct tcatgatgct gatgtcaact cccagtctgc aacaggaaac 24 0 

actgcgctaa cttatgcatg tgctggagga tttgttgaca ttgttaaagt gctccttaat 300 



<210> 252 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 252 
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gcacttctct ctcactggaa agagaactgt tctcctttct ctttcttctg cctattaagc 
ctctgctcct aaactcctca tgtgtgtctg tgtcctaaat tttcctggca tggcaggaca 
aaccccgggt atttaccaca gacaacaaaa ccgcttcact atgatgtatg catgctgcaa 
aggaagagac agaatcttgc tctatcaccc agctggagtg cagtggcacc attgcagctt 
actgcagcct caaactcctg gctcaaggga tccttcagct tcagcctcct ggttaactag 



60 
120 
180 
240 
300 



<210> 253 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 253 

gtctgatgca ggagaattgc taaaacccag gagggagagg ttacattgag ccgagattgc 6 0 

gccactgcac tctagcctgg gcgacagagc aagactccgt ctcgaaagaa agaaagagaa 120 

aggaaattcc ccagggaagt acctcggctt atttcataaa caggtactga aggaagcaga 18 0 

ggcatgtgga ggacttcccc acctcgtgca gctatttggg ccgtggcatc tgaaatttct 24 0 

tatttcagag tcaccccttt gatgaccttg gcagtgaact gcagtcatct gtttaggcct 300 

<210> 254 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 254 

atgttacaga catgaaatat gaacagaatg ctaaaagaac ataaaagaat aagagctcct 60 
taaagattat aaataaatgg tgatgttaaa gtaatagcac cattggacga agctagggaa 12 0 

tcaacacttg acagaaagat acatattttt tttatacaaa ctacatatat ttgagcaatc 180 
aagtagtaga catagagaat tttcttttta tggaagtact ctaataagta aagggctgat 240 
agaattatat cagcattttc tagctcctgg ggaattatgc attgggcatc catggctgct 300 

<210> 255 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 255 

gctgcctgtg gcatagccac tgctgtacgt ttttggttgt tnttaagaaa ctcgatgaag 60 

aggggtgtca ttctgggctc ggggtggttg ccaatttttc accagaaagg gagccacccc 120 

ttgcaaccac ttctgtctcc gttagccccc cctctgccct cctccaagcc aaagcgtggc 180 

ctggcttttg tcttcccatt tagttttcct cttttaccct tccttttgtg cttaatttat 24 0 

taaaatagtt gctgtataat ttattttcat aaactataaa aaaatactaa atggttaaaa 300 

<210> 256 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 256 

acagtctcgg gtttcatatt ttgctgtttt tgatggacat ggaggaattc gagcctcaaa 60 

atttgctgca cagaatttgc atcaaaactt aatcagaaaa tttcctaaag gagatgtaat 12 0 

cagtgtagag aaaaccgtga agagatgcct tttggacact ttcaagcata ctgatgaaga 180 
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gttccttaaa caagcttcca gccagaagcc tgcctggaaa gatgggtcca ctgccacgtg 24 0 

tgttctggct gtagacaaca ttctttatat tgccaacctc ggagatagtc gggcaatctt 300 

<210> 257 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 257 

atagaactag gcactgattt gtfctatattt atcctgctcg agacacatga tgtttcatgt 6 0 

atctgtggct ttttatagtt taaaataatt tctggaaaag tcatagtcat tatctcttta 12 0 

accgctccct ctcttccatt ctctttgttc tctcttcctc gaactcctgt tagtcatttg 180 

atcctccata tctctgaata tttttgtatt tcttttatta tttatttctt gtctctgcta 240 

cattttacat tgagtaaaag tgggatgtga cagtgggaaa tcattagtga cttagaaatt 300 

<210> 258 
<211> 285 
<212> DNA 
<213> Homo sapiens 

<400> 258 

tactctatta tattgtgcat gctcctgatt tagctgctct tggcatcatt ggtcgcagtg 60 

gaaccttgaa atgcatctgg ctagatttat gctcaaatca ttctcagtta gccttttagt 120 

gcctcttcaa aggttttttt ttgtatgttt tctattctta ataaaagctt aggattaatt 180 

agaaagaatc tgatatggtt atgtttcccc ttgtgtacgc tgacctcatt catacgtttt 240 

tcatagtcca gtggtctaaa cgctttcaag agcccagctc cttgg 28 5 

<210> 259 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 259 

gccttctctg gcctcaccaa ttaggtcaaa tgttccttag aatgtgttgt ggggcatggt 6 0 

ctctccctgt gaggacctgt ccagctggac ctccgccttc ctgcgactgt attggtgtct 12 0 

ttccctctca agcctatgag ctctgcaagg gcagggaccc tgtatgattt tgcctatcgt 180 

atgtcctcca gcccccagca cagcgcctgg tgtccagtga gagctcagca aatactttgt 24 0 

gagttaagga caggcggctg ggtagatgga tcgtctgcct agacagggca gttattcgct 300 

<210> 260 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 260 

gaaaagggag ccgcgcagcg cctacgggag tccggcggca gcagccggta ccggcaacca 6 0 

cgggcagctc tcagggaatc tccgtcgtga ggccagaggc tccagtcccc gcgagtccag 12 0 

atgcctgtcc agcctccaag caaagacaca gaagagatgg aagcagaggg tgattctgct 180 

gctgagatga atggggagga ggaagagagt gaggaggagc ggagcggcag ccagacagag 240 

tcagaagagg agagctccga gatggatgat gaggactatg agcgacgccg cagcgagtgt 300 
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<210> 261 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 261 

tttgctttca gtggttggct ttcactgaaa gaaagtgtaa aaaaagtcag aatttatagc 6 0 

tttcactatg tccaagacta ggactgggtt ataaagattt tcttttgtga aggaaaataa 120 

aagaaaattt gccactactg catttacttt actattgtaa acttaagatt cattccttag 180 

tctttggaat tttgatgtct caaaaccaga tgagtggaag tgctgaattt gcaaaataaa 240 

gctaagaatg cttaactctg cactttaagt tctactctga ccaaattgaa gatgagcaga 300 

<210> 262 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 262 

ttttttaaga gataaggtct tgctatgtta tctaggctgg cctaaacttc tgggctgaag 60 

tgatcctcct gtgtagctgg gactacaagc atgtgccacc aatgcctggc ttctcacact 120 

gttttgtaac atagatatgt gaagatgtgt attatagaat tgtttgtaat actgtagtgt 18 0 

tgtaggcaat gtgactgtct atagggaagt ggacaggtta tttgtggtaa atactcatgg 24 0 

aaaacggtca agcagttaaa agcaatcaat tatggtcacc cagcaatgca gataaatctt 3 00 

<210> 263 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 263 

agaacaggga gaagagagga agagggagct gcaggtgcca gaagagaaca gggcggactc 6 0 

tcaggacgaa aagagtcaaa cctttttggg aaaatcagag gaagtaactg gaaagcaaga 120 

agatcatggt ataaaggaga aaggggtccc agtcagcggg caggaggcga aagagccaga 180 

gagttgggat gggggcaggc tgggggcagt gggaagagcg aggagcaggg aagaggagaa 24 0 

tgagcatcat gggccttcaa tgcccgctct gatagcccct gaggactctc ctcactgtga 3 00 

<210> 264 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 264 

ttaaaggtag ttttagaagg aagtacaaat tggctttcat cttgcaaaca atcgtttttt 60 

acttcattat cttaatttgc tttgtcactc ataaaaagga aaccatacct gagttgtaga 120 

caatgaggaa acacttgagg cttctgctgt gtgttctttt gttattgttg ttattgttgt 180 

tactcagtaa cttgaatatt gtttaatgtg ttgtaagacg tagagtttat ctcaagctgt 240 

taaaaatggt aatgtacaaa tgtgaataga cacttatcta tataatatgg gtaagttttg 300 



<210> 265 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 265 

caggaaagtc ttcctagagg taatttttaa gctgattgtt ttagaattag tagaagcttg 60 

ccagatggaa aagtccaggc aaagtgtaac atgaatggga aaggccacag tctagaaatg 120 

gcagagtgtg ttcctagttt gtttgtttgt ttgtttgtac ctgccttgtt ccaggaagga 180 

tttaatgtgg tttatattcc agtcctttaa tgctggaagg gctgagatga gactgaaaga 24 0 

tgggcaggaa gtatatcatc acaagctttg tgtttgatgt taatgtgtat gatttttata 3 00 

<210> 266 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 266 

tgtgccacca cacccagctc attattatta ttattattat tattattttg agacgaagtt 60 

tcactcttat cccccaggct ggagtgcaat ggtgcgatac tggctcactg caacctctgc 12 0 

ctcctgggtzt caagcggttc tcctgccttg gcaggcacct gtagtgtcag ctactcgaag 180 

gctgaggtgg gagaatcgct tgaacctggg gggcggagat tgcaatggtg tggtctcggc 240 

tcactgcact cgagcctggc gacagagcaa gactctgtct caaaaaaaaa aaaaaaaaan 300 

<210> 267 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 267 

atataactct ggaggtcagg acataggaga tattgattca ggacttgcca gagtatggtc 6 0 

ttggggtgtg ccctgatatt acaaacaggg atcttagtgg ctaggtgatg aggccatggc 12 0 

aaatgtagat ggaccaagat caatttgcct ttctagatga ggttttctag gtgaaatgtt 180 

tttgaaacta ttttgtagcc tagtataatt tataaaagta gagagaaact ataaatataa 240 

atttggaagg ggttagctaa aaggagaaaa cagcagaatc ttcatatata tagaaatgga 3 00 



<210> 268 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 268 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgtgt 6 0 

gtgttgcatt acttttcatt gggattgaac tgagaaataa caaacaagct ttaagtggga 12 0 

aattaaaaaa aagaagtaac ctatgtagat ccaaacttaa aatgtgagaa attattgaaa 18 0 

tttcattttc tacaaacttg aaattagcct gctaattgta aagttgtttt aataatgctg 240 

acaaatgtca gttacgtttg caaaggagtg tatggttcta ggtatttgcc tactgttacc 3 00 



<210> 269 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 269 



64 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 



PCT/US98/27610 



cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 6 0 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 240 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 3 00 

<210> 270 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 270 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 60 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 240 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 

<210> 271 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 271 

ccacatttaa gtgagatatg ggaaggagga gcagattgtt tttgaaggga ggaagagcag 6 0 

ttacttaggg tcaaattaag ttgtaaaatc ccccccggga ttttgtatgt aagtcaaagt 120 

gaattgtatt tggaagaaga actggggagc ccacctctgg tatttttttt atgtccctca 180 

tatggacaaa taaacctctg gtattaaatg aattttcttt tgggggattc tatatattcg 240 

ggatttcaac caccaaccta tctggttttt cccgctgaaa tgttgggtga tggaatcagg 3 00 

<210> 272 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 272 

gaacgcttcc attttatacc tgtgtctagt tagtttctgc ctatctatcc aagaagcttt 60 

tatcaagggt ccaccatgtg ccagccactg aagtagatat aaatacaagg atgtgtaagg 12 0 

tatggatgat ggtatacgaa ctgtcatctt actggatttg tccgctctgt taaagatacg 180 

gttccgaaaa ctttttaaag ccctagagag ggctttaagg caatgtagca tcatatatag 240 

aggcatcaac ctgttcatat ctttctattt aacagaactg tgcacctggg cacaagggtg 300 

<210> 273 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 273 

gaatggcgtg aacccgggag gcagatggtc ttaaagtggg gagacccggg ttacaggcct 60 

gactgcatca ctaactcgct gtgtgtccct gggcaagtca gtgcagtgca gtagcctctc 12 0 

cgtctccgac tgaggagcaa agccctcggc tcaagatcct cacctacttc acagggattt 180 
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gaaatagtgc agtcaacagg aaaagaaaag cgctatagaa atgctcgacg ctatcacttg 24 0 

gggcccacgt ggaagtatca acgtataaat tggcccaggc agacagaagg atgcagggga 300 

<210> 274 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 274 

ggaaccaggg gctgcagaac cagcccctcc ccaatgagga ccccctctgg acgcccctcc 6 0 

ccatggagaa caccaggagc cacagacccc agaccacagg agcacacagg ggagggcacg 12 0 

gggcggccgg ggcagggtgt ctgctgcctc gtttatggga tttgctccgc gtctagcaca 18 0 

ctgctgcctg cagtgctcct gtcccctgca gtggctactc tgggcctacg ggcctaatcc 24 0 

tggttggcat gaaaatgtcc tgaggctact gtgacaaatt tccacaagct gagtggctta 3 00 

<210> 275 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 275 

ctttgggaag cagaggtggc aggatcattc cagcccagga gttcaagacc agcctgggca 6 0 

acacagtgag tgagaccctg tctctattta agaaaaaata attaagaaat tttattaaaa 12 0 

aagaagaatc aggaaaccaa gtccaaccca actaaacctc aaatgaacca gcccctaaca 180 

cagatgaggg gatttgggac tgataagctc tgtgctgtgt ccatggcccg tcatttatca 24 0 

aggctgcagc tttgtaaatg tggctatttt tatgttgtgt atagtttcta tcatttattt 300 

<210> 276 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 276 

tttgtatttt tagtagagac agggtttctt catgttggtc aggctggtct caaactccta 6 0 

acctcgtgat ccgcctgcct cgacctccca aagtgctggg attacaggca tgagccacca 12 0 

tgcccagcca aagatcattt ttttatatag acttcagccc tttgtaaata ttgtaactgg 180 
ggagtataga gtagaaaaaa agtatagtta aaacatttgt tctacaaatt aacctttaaa 24 0 

aatataatta ctgctaaaaa tagagtgctg ttacacttaa ggaaaattag tgccattttg 300 

<210> 277 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 277 

ctcacacagc atgtgtcaga tccatggggt aggagtcggc cagagacttg gtaacagaca 6 0 

gattgctgga tcccacccct agactctctg attcagttag tttggggtaa ggcgcaagac 12 0 

tgaatttttc acaagtttcc cagtggtgct gatacttctg gtccaggaac ttagtgggag 18 0 

agaacgacta atctagacca tttcacttca cattctgagc ttcttgtaca ctgtcacact 24 0 

gcatcctttt aacaatgcat tccctatcct attgcaatac tgacatctca tcaatatttt 300 
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<210> 278 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 278 

ctgacaactt gattgggttc tccttcaggt ttgaagcgcc ctcgagaagt gtctaaagga 60 

gacagttgat agccaaacaa cagttttgga ttcactgact gattatgaaa gaagcagtag 120 

actggtatca agaatcagtc agcaaggagg ccctcaccag acgccagtgc catgttcttg 180 

gacttctcag cctccatatt catgaactaa gtttttggaa tccttaggct tccacgtgtg 240 

gaaagcctga gctaacctac tggaggatga gccatcacct ggagcagatt caggccatcc 300 

<210> 279 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 279 

ggtaaaccta tttatataat agaaggatga ttataaacat ttaataaatt atatcaaata 60 
gatattatat attaaatggg cagataatag aaatctgtcc aagcaaaact ctggataatt 12 0 

tttatgttgc cttatttttt gttttctgtg aactccaaga aaaatgagat accagtttgg 180 
aacagatgta atattgctga tttaacagtt tagggatact ccccaagttc aataattttg 240 
ccaagataca aatttaaatg gaacctttta tgaagcttca tagtgtgtga agaacttacc 300 

<210> 280 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 280 

ataactgctt gcgaagatgt agtttctctt tggaaagcta cacacacgag attatacaca 60 
tcaggcactg gaactatctg taatactgga acctctgcga agtgccaggt ataaagtttt 12 0 

tcccactgcc aagcatccag agctttggga aatttggaaa tcagagagat cagggcattg 180 
ttttgttcct ctgatgatga aagtgaaaag caagtactac tgaagtctgg aaatataaaa 240 
gctgtgcttg gcctgacaaa gaggaggcta gttagtagca gtgggaccct ttctgatcaa 300 

<210> 281 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 281 

caccatcgaa tatttttatt tattttgaga gacagactct gtcacccagg ctagtcttaa 60 

actgttggtg aatcttaagt gattctccca cctcagcctc ccaaagtgct gggattacag 12 0 

gcatgagcca ctacccttgg ctgtgatcaa gtatttagtc tgttgttaaa tgtttactaa 180 

atagtctgaa gtagagaaaa tagcacccaa tctaaaataa ggtgaggtct agtcacttat 24 0 

ttaaatctac attttaagct atagtttact attagtttaa actttaagac aggtaatgtt 300 



<210> 282 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 282 

gcaaccttcg cctcctgggt tcaagtgatt ctcctccctc agcatcccaa gtagctggga 6 0 

ctacaggcac gtgccaccac acccagctaa tttttgcatt tttagtagag gcagggtttc 120 

atcatgttgg ccaggctggt ctcaaactcc tgatctcaag taatctgccc actttggcct 180 

cccaaagtgc tggcattaca ggaatggagc caccgcgccc agcctgattt ctttttttag 24 0 

gtcttgtcag gaaagatatt gattcttttg attcgtgaac atggtttttg gtcgtcttta 300 



<210> 283 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 283 

cccaggtagc tgagactacc cacaccttgg tcccagctac ttgggaggct gaggtgggaa 60 

aatcactttg cccaggaatt caaggccgca gtgagctatg attgcaccac tgcactccag 120 

gcaacagagt gagaccctgt cttaaaaaaa gaagggagaa agtgtcagat ggtgatgagg 18 0 

tctggggggg aaatagagaa tggggatcag gagtgtggat ggtggtattc cctcaccaag 24 0 

aggtgacatg tgagcaggga gctgggaggt gagggtgtga cccgtgtgga aatcagggaa 300 



<210> 284 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 284 

ggtgtcctcc ccagtgcgcc gcgatttttg tgtccaagcc ccagagtccc tctgagacca 6 0 

acccccagcc agcacagact tcctgccttc ccagctcgga agcgccctcg agaagtgtct 12 0 

aaaggagaca gttgatagcc aaacaacagt tttggattca ctgactgatt atgaaagaag 180 

cagtagactg gtatcaagaa tcagtcaggt ttttggaatc cttaggcttc cacgtgtgga 240 

aagcctgagc taacctactg gaggatgagc catcacctgg agcagattca ggccatccta 300 



<210> 285 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 285 

aattccgttg ctgtcgggcc gccatgtcat tctggagaga gacagagtaa aacaaagaag 6 0 

gtgatgggta aagcgcagtc gcctgctata tattgtctat ttttggtttt tcacttacct 120 

tttatattta tgtcttttat gtacaacagg attataagta gcttgagtcc agtgaatata 180 

ccatttcatt ttgctatcct tcactgcact tagcttagag gaaataatca cagcttatta 240 

ttgattaatt aattaattaa tagatgaatg gtgaacacat gactatcatc ccaagaaatg 300 



<210> 286 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 286 
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agccaatgag gcttttgcct 
tgtgcagagg gcccctcaca 
gcgcaagtgg gatggtgacc 
aaacccatgc cacctgaagg 
tgtggctgaa gccctggagt 



gccagcagtg gacccaagcc attcagcttt acagcaaggc 
atgccatgct ttatggaaac cgagcagcag cctacatgaa 
actatgatgc cctgagggac tgcctcaagg ccatctccct 
cacactttcg cctggcccgc tgcctctttg agctcaagta 
gcctggacga cttcaaaggg aaatttccgg agcaggccca 



60 
120 
180 
240 
300 



<210> 287 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 287 

gggtgacaga gtgaaactcg tatctccaaa caaacaaaca aaaagtcctt aaacatatgt 60 

gaacaaaaat tttgtgatgg aaggattcta gttaatgagt attgcatcaa gatttacatc 12 0 

tttcttacta aggaaaagag ttaataaaaa ttgttcttta ttttacaggc agttactgag 18 0 

gctcttccca gatctcagta aacagccact cagccttgaa aatggagtgt tgttgtttct 240 

aaacatatat ttatgtcatt tattaagtac agttcactta aataacataa gtagattttc 300 

<210> 288 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 288 

accactaaca gcatctactt gactactgat actttgatca tggagttagg gcatgccact 60 
tgatagaaat ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata 12 0 

gattgcaaca tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag 180 
cctttgaatc ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa 240 
tacaagcaga ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt 3 00 

<210> 289 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 289 

tgtccttatc tgaaattcag cgatcttcat gaataagcat ttctctgatt gtggnatatg 60 
cctttaattt tatttctaga gtgacaaatt tttggttttg acagtttttt tctagcttta 120 
tagtttcttc ttggggagag aatatgtcaa cctcactcca tcatgctgaa gtaaatcttc 180 
atctcttaat tttatctctc aaaaatatcc taaggattcc ctctggagcc tgataagtaa 240 
ttgcagtatc tggtttctat ggttggatga ttcaggattc caggaataat agttactttt 300 

<210> 290 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 290 

ggaaccatga gaaccgaagc tagaattgct attgaattac tttattttct cttcccttat 6 0 

tgggtagaga tacatcatta ctggcctcag gggtttaccc aaagaaaggg tatttttgag 120 
caaataatgt gatttcctgg ctattttgtt gggggcttaa gatttttttt tttcaaatgc 180 
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atttttagtc actaaaaatt aactgtcgta ccatctagaa ctatactgtc cagtaccata 240 
gcctctagcc gtatgtagct atttgtatta agattaatgg aaattttaaa tccagttcct 300 

<210> 291 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 291 

tatgatttta tttttggcct aatataggaa tgtttaaaaa aggcttttct atgaaaatta 60 

gaaatttata cttgaaatta aaagtctaca agggggagga ccttaaagct aagctaccag 12 0 

taagacaatg aataattcag aagagaacac tattctttta ctgactgagt gcccaagatg 180 

ccaatttcca tgaagtcttg atttatatat atgtacacat gttatgcaca tacatgtttg 240 

ttttctaaca gttattcttt aagcttttga gataatttta gacttacaga agagttggaa 300 

<210> 292 
<211> 278 
<212> DNA 
<213> Homo sapiens 



<400> 292 

cccagaccta tggagtcaga cagtaggttt gaggcccagc aatctatggt ttaacaagcc 6 0 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg tttggtatgg 12 0 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 180 

ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttn nnnnnnnnnn 240 

nntgntttnn cccccaggnt gnanngcagg gncccaat 278 



<210> 293 
<211> 297 
<212> DNA 
<213> Homo sapiens 



<400> 293 

ggaaggcagt gggaggagag gaccaagtct caaactccag aagccccacc tccctgagct 6 0 

cagctcctct gccaagcccc ctcagcgcga agtcctcgtc cagagaaggc aacggcgaga 12 0 

aacaaatcca acatcctggg ctgctttttc cttcccccac tttttaaaag tttggtgtcc 18 0 

aagtcacttg acaaacccag accctaacaa tgatattttg tgtagaattc tgggatcaaa 24 0 

atataatttc aaaaataata tattttctga catcccccaa aaaaaaaaaa aaaaaaa 297 



<210> 294 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 294 

ggaacagttt gagcaaaggc tctcaagtaa tagggtgtct gacttgttca tttttgaaag 6 0 

tagaactaat aggatttctt attggaacgt agggtgtaag agaaaagagg agtcaaaaag 12 0 

agccacaaga tttttggtct cagcaattag aaggatagaa ttgacattta ctgagatttt 18 0 

tgtttttgtt tttgagacgg agtttcgcta ttgttgccca agctggcgtg caatggcgtg 24 0 

atctcggctc agtgcaacct ccacctccca gattcaagcg attctcctgc ctcagcctcc 300 
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<210> 295 
<211> 299 
<212> DNA 
<213> Homo sapiens 

<400> 295 

gtaatattga tgtgattgtt gtcgcttgag aaaaaaaggc aacagctgat tctttcaaca 6 0 

actgtcacag aatggctggg ctgagaacgc tgcccagggc cctgcagctg gcgggagnnn 12 0 

nnnnnnnnnn nnnnngtgcn tgctgcaaca tntggttana tngtatcctt ccctanagnt 18 0 

gctacnnctt nnatcccctt gtnaatatgt tgagntnnct tngcnttcnn gntnntccng 24 0 

ntnnttgaca cntatgnaan ttntntngtc tngctctgct ngatnncttn nangctgcc 2 99 

<210> 296 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 296 

gcagaacctt ttcccctcta ctcttgtcta aaagttctgt gtggcacaca gagatgcgac 60 
ctactcaatc tgacttagta aaaccatgct gaaaaatttt ggtctaaaaa ggacccatac 120 
ccagcaccca tgaaataaaa gattcatctg taattgggat tcaaagggat taaattcctt 180 
tggtcatact cataaatagc actaaagtgt tataacattt tcatttacct atttttagtt 240 
ccttcatttt aacttaataa aaatcttgga ttgatattct tttttttttt ttttgggacg 300 

<210> 297 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 297 

gctaggatta caggtgtgag ccaccatgcc cagccactta tctttaaagg attaagttta 6 0 

tgtttcctac tatgggaaac catcccaccc caaacttgat gaccgcatta tgtgctttta 12 0 

tagaacatgg cacttctcca ggatagcatt tattctgttt tgtaagtgtg aatgtaatta 180 
ccctacacac agcatacaca taatcttcat attctttgcc ttgtcttgtg aaggcaaggg 240 
ccatgtctat cttattcgtc attagattcc cacatccaac atagtcctgg ggacagcacc 300 

<210> 298 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 298 

ccaaatctgc ctagagattg agttcacagt gtatgttctg ggggcgctgg tgcagtcagc 60 
ggtccagtct ccagcctgca ggcgtgcaca ctggggtgga cgatgggtgg ccccgcaggt 12 0 

gtacacattt gggtggcccc ggcccctata ccccagtgtt ctctttgatc cagtcccgaa 18 0 

acagagggag ccttgtgtac acgcctccaa agtggagctg ggaggtagaa ggggaggaca 24 0 

ctggtggttc tactgaccca actgggggca aaggtttgaa gacacagcct cccccgccag 3 00 

<210> 299 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 299 

ctccattgtg aagatccagg catttttccg agccaggaaa 
attagtgcat gcaccccacc ctcctctcag tgtggtacgc 
tcaaagccag caagacttct ctgctgctgt gatctgcaca 
ctggggggat gcagtgtgtg ttagtgccca tgtggcattg 
gcggcatggg caagatgacc ttccattagc ttcaagtctt 



gcccaagatg 


actacaggat 


60 


agatttgccc 


atctcttgaa 


120 


ccctccaacc 


tgggcaggga 


180 


tggcactgtt 


gccccccatg 


240 


gttctcttgt 


ctgtggtctg 


300 



<210> 300 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 300 

agcaattcca ctcctagctc cacccacagg aattgaaagc 
gtgcaccaaa gttcacggca gcatccttcg ccatagtggc 
catcatcctt catcatagcg gcagcatccg tcgtcacagc 
cggcagcatc tgtcgtcaca gcggcagcat ccttcgccaa 
tagcggcagc atcctttgcc atagcggcaa ggtggaaacc 



aaagacgcaa 


acagatgcct 


60 


agcatccgtc 


gtcacagcgg 


120 


ggcagcatcc 


ttcgccacag 


180 


agcggcagca 


tccttcgtca 


240 


ctgtccatcc 


actgaggcgt 


300 



<210> 301 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 301 

tcacagatat gaaagttcag tcagaggggc tgggccgaca 
atttttagga tcagtgagac ggtgtgtatt tggaagcatt 
ttacttccgt gggcacctgg tgttattggt tggactagtc 
aagcaatggg atgtgtgtgc atgtgtttgt gcagagacag 
tggcttatgc agttgtgggg gctagcaagt ctgaaatttg 



tctgtgcttt 


tccctgcagg 


60 


tcaaatgtgt 


taccatcgtg 


12 0 


aggattctcc 


agagcagcag 


180 


aaagagagat 


tttaaggaac 


240 


cagggcgggc 


cagcaagctg 


300 



<210> 302 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 302 

tcaccaggaa tacagtgaca ttaaaagtgt gatatggttt 
ttcaacttga actgtatcta tctcccagaa ttcccacatg 
gaggtaactg aatcatgggg gctggtcttt cccgtgctat 
acgagatctg atgggtttat caggggtttc cacttttgtt 
cagcatgtaa gaagtgcctt tggtctccta ccatgattct 



agctgtgccc 


ccacccacat 


60 


ttgtgggagg 


gacccagggg 


12 0 


tctcgtgatg 


gtgaagtctc 


180 


tcttcatttt 


ctcttgccac 


240 


gaggcctccc 


tagccatggg 


300 



<210> 303 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 303 
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gccctctcca ttttctgagg aggtgatatt tgggcagatt acaaactgag gaagcatact 
ggatagacat caggatgaag agaataggca gttgaaaagt cccagaaagg ggagtgtgct 
tagagtgttt gaggaacagc aaggaagcaa gcccttgttg aaacagattg agcaaggtag 
aaagtggtaa aagatgaagt taaagaggta gctgagagcc agatcatgta aagccttggt 
aaggactgac ttttatttta agagggttag gaagacattg gtaggttttg actctggctt 



60 
120 
180 
240 
300 



<210> 304 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 304 

aacaggaata tggaaagaaa ctcagagccg agttagtgga aaagtggaaa gcagagagag 6 0 

aggctcggct ggcaagagga gaaaaggaag aggaggagga agaggaggaa gagatcaaca 120 

tctatgcagt caccgaggag gagtcggacg aggaaggcag ccaggagaaa ggaggggacg 18 0 

acagccagca gaagttcatt gctcacgtcc ctgttccctc gcagcaagag attgaggagg 24 0 

cactggtgcg aaggaagaaa atggaactcc tccagaagta tgcaagcgag accctgcagg 300 

<210> 305 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 305 

aatagtagaa agggtcccca ttcctgctca gcaccgcacc tctctacccc cccacagaca 60 

cacatgcaga cacacacatg cagacaacac gcagacacac acatgcaggc actcacatgc 12 0 

aggcccatgc acacacacgt gcacacacat gcagagacat gcagacacgc aggcacacat 180 

gcacacatgc aaagacacgc atgcaggcac acgcagacgc acacagagac acacatgcag 24 0 

atacacatgc acacacacat acacacactg gcccctgttt ttctgtggtg tcactgggtg 3 00 

<210> 306 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 306 

cagcaaagac tttatttttg tacagaagat ggtgaagtcc aagacggtgg ctcagtgcgt 60 

ggagtactac tacacgtgga aaaagatcat gcggctgggg cggaaacacc ggacacgcct 12 0 

ggcagaaatc atcgacgatt gtgtgacaag tgaagaagaa gaagagttag aggaggagga 180 

ggaggaggac ccggaagaag ataggaaatc cacaaaagaa gaagggagtg aggtgccgaa 24 0 

gtccccggag ccaccacccg tccccgtcct ggctcccacg gaggggccgc ccctgcaggc 300 

<210> 307 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 307 

gctgcttctg gctggggggt ccttggcctt catcctgctg agggtgagga ggaggaggaa 60 

gagccctgga ggagcaggag gaggagccag tggcgacggg ggattctacg atccgaaagc 120 

tcaggtgttg ggaaatgggg accccgtctt ctggacacca gtagtccctg gtcccatgga 180 
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accagatggc aaggatgagg aggaggagga ggaggannnn nnnnnnnnna ntggccttnt 24 0 

gtggcctcca ccagcagctn tnnannatga catggagtcc caactgnacg nctccctcat 300 

<210> 308 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 308 

agttaagagt gtgaacccta gatttgccat ctgaaagtca tgtgtccttc agtgatgcat 6 0 

ttaacctctc tgtgcctcaa atttctccct ctggggtatg ttaggagtat acaaattaac 12 0 

acatgtaaag tgcttagaat agattggtac tgttaaatat gagctaacgt cacatttgat 180 

atttttttaa aaagaaaaaa tcattatgga gtctcagtcc tagagattct gattcattaa 240 

ttctgcttct cggcaaggag cgatttgctg gtgtagacat tccgggtccg tgtaaagggt 300 



<210> 309 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 309 

ccaacaccca gttctcactc tgtcatccag gctggtgtgc agtggtgcaa tgtgggctta 6 0 

ctgcagcctt gacctccagg acaagtgatc tcccacctca gcctccggaa tagctgggac 120 

tacagctcaa caacgcccct ctgaaagtag gactcttgga aatgaacctt gttgggagta 180 

aagctgaacc ttcacctctc ctttccagga ttctactcca ttcatacggc ctcacactga 240 

attaatggtt ctagcagcca catcactttg ttacccaatt gatctagtag taaagtcttc 300 



<210> 310 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 310 

aggaaacacc cccttataaa accatcatat caggctgggt gatctgacag agctagacac 60 

tgtcaaacaa acaaacaaac aaacaaaaaa accccatcac atctcatgag acttatttac 120 

tatcatgaga gcagctcagg aaacacccac tcccgtgatt cagttacatc ccactgggtc 180 

tgtcccacaa attgtgggag ctacaattca agatgaggtt tgggtgggga cacagccaaa 24 0 

ccctatcacc atgtaaaata atatctaatt tgtagagatt aaagaacaag ataacttaaa 300 



<210> 311 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 311 

ttntgcagat ctccagcaca agcctctgct agttgatctc acggtagaag aaggtcaaag 6 0 

attaaaggtt atgtttggtt cacacactgg tttccatgta attgatgttg attcaggaaa 12 0 

ctcttatgat atctacatac catctcatat tcagggcaat atcactcctc atgctattgt 180 

catcttgcct aaaacagatg gaatggaaat gcntgtttgc tatgaggatg anggggtgna 240 

tgtaaacacc tatggccgga taacnaagga tgtggtgctc caatggggag aaatgcccac 3 00 
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<210> 312 
<211> 275 
<212> DNA 

<213> Homo sapiens 
<400> 312 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 6 0 

ccacccccca cgcccgaaca gcaggggcag aggcagnnnn nnnnnntaag ngtgtnnaan 12 0 

tntnnatttn ttcctntttt ttttnnnttn aaatatnntg nnnttttttn ntantantta 180 

ttatnntntn nttattannn tntttttcnt ntnttacttt gttnttgatt ttanncnttt 24 0 

natntttttt ttgttcttct nttntattnn atctt 275 

<210> 313 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 313 

tcctgtcttc ttgcccaaat gttgcatttt ccaagaccac tctggcctgc catgccaccc 60 
attctgtgcc tataaaaacc ctgagacccc agcgggcaca cacacaagcg gctggacgtc 12 0 

aagaggaaca cactggcaga agaacacatc gaaagacgct ggcaggccat tgatggtgga 18 0 

acgattcgga cgccaaggga aattcggcca aggacagtag gagatcccgg ctgctgagca 240 
gccagactcc agaggaagac taccttccca tgctatcccc cttctggctc cccagccatc 300 

<210> 314 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 314 

ataagggtgg ggccttaatt cagtagaatt ggtggcctcc taagcagagg aagagagatt 6 0 

tttctttctc tctctgccat gtgaagacag tgaggagtcg gccgtctgca agccaagaag 12 0 

agcccttatc aggaacagac ttggctagca ccttcatcgt ggacctccag cctccagaat 180 
tgcaagaaaa tacatttccg tcgttgaaac cacccagtct gtggtatttt gttatggcag 240 
cccaggcaga ctaatacgtg aagcctgctc taaatagata aaataagaaa ttactacaga 300 

<210> 315 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 315 

gtctcagtgt ggcctgtggg gctggtgggc ggcctgcgat tcgagggccc tcaggtacag 6 0 

gacggccggg tagtgggctt ccacacagca tgggagccca gcaggccctt ccctgtggat 12 0 

atggctggat ttgccgtggc cctgcccttg ctgttagata agcccaatgc ccaatttgat 180 

tccaccgctc cccggggcca cctggagagc agtcttctga gccaccttgt ggatcccaag 240 

gacctggagc cacgggctgc caactgcact cgggtactgg tgtggcatac tcggacagag 300 



<210> 316 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 316 

gaaatgcctc tatgtaggtg aagtgttctc tctgcatgca acaggaaaaa ttaatataat 60 

attttcccca caaaagaaac acttaacaga ggcaagtgca atttataaat ttatatctaa 12 0 

aggggaatca tgattataag tccttcagcc cttggactct aaattgaggg gattaaaaag 180 

aatttaaaat aattttgaac gaatttattt tcccctcagt ttttgagggc attaaaaagg 240 

cattaaatca agacaaatca tgtgcttgag aaaaataaaa ttaatgaaaa cacagcactt 300 



<210> 317 

<211> 295 

<212> DNA 

<213> Homo sapiens 



<400> 317 

acactgtccc actccatcac ccaggctgga gtccagtggt gtgatcatag ctcgctgcat 6 0 

cctccagttc ctgggttcaa gccatccctc ctgcctcagc ctccccagta gctggaacta 120 

caggtgtgtg ccatcacacc tggctttaca tttttctgtg gggacttact atgttgccca 180 

ggccggcctc aaactcctga gctcaagtga tcctctgcct cagcctccag agtatctggg 240 

attacatatg tcggctaccg tgtctggccg ttcacatctt tggccactat ttgct 295 



<210> 318 
<211> 261 
<212> DNA 

<213> Homo sapiens 



<400> 318 

cctgaatata aagaggagga ggaagaccaa gacatacagg gagaaatcag tcatcctgat 6 0 

ggaaaggtgg aaaaggttta taagaatggg tgccgtgtta tactgtttcc caatggaact 120 

cgaaaggaag tgagtgcaga tgggaagacc atcactgtca ctttctttaa tggtgacgtg 18 0 

aagcaggtca tgccgaccca agaannnnnn nnnnnnnnrm nnntngccnn aacnnttcac 24 0 

caaatncccc gggggggctt g 261 



<210> 319 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 319 

gggacctctg cccaagaaag cctgggtatt gaccaaggtt tcccccccac tgagacagcc 60 

tgagatatgg cctcatggga agggaaagac ctgactgtcc cccagcccga cacctgtaaa 120 

gggtcggtgc tgaggaggaa tagtgaagga gggaggcctc tttgcagttg agataagagg 180 

aaggcttctg tctcctgctt gtccctggta atggaatgtc tcggtgtaaa gctgaccatt 240 

cccattcgtt ctattctgag ataggagaaa accgccctgt ggctggaggt gagatatgct 300 



<210> 320 
<211> 289 
<212> DNA 

<213> Homo sapiens 
<400> 320 
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caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca gtgcaggaca 
caagctcatg cccccgtctt gccagtgaca cttgaagcct cccgacttcc acagagtgct 
tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt ttttnnnnnn 
nnnnnntngt tntgtnnccc aggctgnann gcaggggcct gatntnggnt aantgnaacc 
ttngcctccn aggttaaagc nattttttng cctaancctc naaagtacc 



60 
120 
180 
240 
289 



<210> 321 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 321 

gaaagaccga gatagagaga gagacagaga cagagagcga gaccgtgatc gggacagaga 6 0 

aagagaacgc accagagaga gagagaggga gcgtgatcac agtcctacac caagtgtttt 12 0 

caacagcgat gaagaacgat acagatacag ggaatatgca gaaagaggtt atgagcgtca 18 0 

cagagcaagt cgagaaaaag aagaacgaca tagagaaaga cgacacaggg agaaagagga 24 0 

aaccagacat aagtcttctc gaagtaatag tagacgtcgc catgaaagtg aagaaggaga 300 

<210> 322 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 322 

cgccctttaa ctgcagttct gctctatttt cttttctctc tctggagctg agagtcagag 6 0 

ggcccttctc ctcctccttt cagcccccaa cactaagctg atggattgat aaatacctca 120 

gcccctcgcc ttcctcaacc cacctggcaa gtcttcttag gatctgatcc cagttttctg 180 

gaagcaatcc taccccagcc caagcttccc aagagtcgag ccttaatcct tctcacttct 240 

cagtgtcaga gcagaaatga atcctggggt tgactgtgtc cattcgggtt attagcagct 3 00 

<210> 323 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 323 

agattatgag catgtagaag atgaaacttt tcctcctttc ccacctccag cctctccaga 6 0 

gagacaagat ggtgaaggaa ctgagcctga tgaagagtca ggaaatggag cacctgttcc 12 0 

tgtacctcca aagagaacag ttaaaagaaa tatacccaag ctggatgctc agagattaat 18 0 

ttcagagaga ggacttccag ccttaaggca tgtatttgat aaggcaaaat tcaaaggtaa 24 0 

aggtcatgag gctgaagact tgaagatgct aatcagacac atggagcact gggcacatag 3 00 

<210> 324 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 324 

gtctgagaag tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc 6 0 

tggcttgcag ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg 12 0 

ataagaactc cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac 180 
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atcaagaaga gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga 24 0 

gctccagggg aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca 300 

<210> 325 
<211> 283 
<212> DNA 
<213> Homo sapiens 

<400> 325 

gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca agaaacacag 6 0 

aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga gcatcctcag 120 

aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta atgagcctct 180 

gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcgtcc acaaggccag 240 

ttcactctct gatcctgtgg aaaggatgcn ngtgtgtagc tgc 283 

<210> 326 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 326 

atgacatcct cattatccac actgcaaagc caaccatccc tatgatgggt tcattgtgga 60 
tcatgactta gtgggtcaag agtttggaag tggctcagct gggcggttct tctgctccat 120 
gtggctgcca gatggtaccc tgctggtggg cagtctggtc tagagggtcc atgatggctt 180 
tactcacatg cctggcatct tgacagggac agctggaagg caaggttcag ctgggactgt 240 
ccacagagct cctccctgtg gcctttccag catggtggtc tcagggtagc tggacttcct 



300 



<210> 327 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 327 

ggtagactgg ctagggatcc tggacccagg gttccacgta gcaacacctg ctgagttctc 6 0 

tgggttttct tcctgcctca tgtagcccag acttggagct gaagaagctg gaaacatgga 12 0 

aacaccaaca gctacagacc aaaaaaagtc ccaacaaagg cctgtcagtc tgccagcctg 180 

ttctgtggat ttccaactca agattgcagc atcaactcac acctgaagtt ctggcttccc 240 

tacaaacttt gaacttgcca gtccccacaa tggcataagc caattcctta aaatgaatgt 300 

<210> 328 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 328 

gtcacaggca ggtttaatgg ccagtttaaa acttatgcta tctgcggggc cattcgtagg 60 

atggtgagtg tttccctggg ctttgctcat cacttcggga catcgtggac tttaccgtgc 120 

gcattggagt gtgtgatggt gcctgagtag atctgctggc agagtagttt gagccagctg 180 

gactgggctg gccgcctgcc gcttcttgag ggtggaagag gggtgctctg agaagacact 24 0 

caggcagcag actctgcctc tcacttaagg tgcccccccg accccgctcc accatagtca 300 
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<210> 329 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 329 

ttttggtcgt ctttaatttg tctcatcagt gcctccatgt gtttttgatg cctttgaact 60 
ggtattttta aaatttcaat ttctaattgt tcattataga aacacaattg ggttttatat 120 
attggcattg tattttgcaa ctttcctaaa ctcactagta attctagtag ctttttttgg 180 
tagattctta aggattttct gtgtaaatag tcatgtcatt tgtgaataaa gccatttttt 240 
tttccttttc aaattttgtg ccttttattt cttattctta ccatatcaca ttggcaaaga 3 00 

<210> 330 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 330 

tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc tggcttgcag 6 0 

ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg ataagaactc 120 
cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac atcaagaaga 180 
gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga gctccagggg 240 
aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca gccacattta 300 

<210> 331 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 331 

accgccctgt ggctggaggt gagatatgct ggcagcaata ctgctctgtt actccttgct 60 
acactgagat gtttgggtaa agagaaacat aaatctagcc tacgtgcaca tctgggcaca 12 0 

gtacctttcc ttgaacttat tcgtgataca gattcctttg ctcacatgtt tccctgctga 180 
ccttcttccc acctgttgcc ctgctacact cccctcgcta agacagtaaa aataatgatc 24 0 

aataaatact gagggaactc agaggccagc gccggtgcgg gtccccccca tgctgagcgc 300 

<210> 332 
<211> 300 
<212> DNA 
<213 > Homo sapiens 

<400> 332 

ggaaaaacaa caggtttgag tcctataaag ccataattta actccagtag ctgatgtcag 6 0 

acaagcttgt cctatgtcct atttgagtgg cagcagcgcc agcccagcaa gaaggctggg 12 0 

ggttgtcaag gttgtcccca gaccttgctt gcagtggttg gagaacccag ggggctgcct 180 

tgggccctct ggccagaggg aagcgggcag ctctagccct ggagattgtg gtcacattgg 24 0 

ggcttgttta ggattggagg gccaggtcac ctccccagcc accctccctt ctctcctctg 3 00 



<210> 333 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 333 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 12 0 

gatcaagtag cggaggccag gattttggcc acctcccggg caagttgcag ggcagtggcg 180 

ccgggagcaa aagcagcatg atgcagctca tgcacctgga gtccttttat gaaaaaacct 24 0 

cctcctgggc ttatcaagga agatgacact aagccagaag actgcatacc agatgtacca 300 

<210> 334 
<211> 262 
<212> DNA 
<213> Homo sapiens 



<400> 334 

gccatgccca tttgtttact cattgtctat ggttgctttc atgccctcac agcaaaggcg 60 

agtagttgtg atggatcaaa tggcccacaa agcctgaaat atttactctt tgacccttta 12 0 

cagaaaaaaa ccttgttgac ccctgcttta gagaatgaga agccatgcag ggatcagtga 180 

tgccagagga agggaaggaa ctgcttccag ctattgtgac aataataata ataataatat 24 0 

tgggtctttg actagaacgt gt 26 2 



<210> 335 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 335 

tcntntctcn ntattnttgn gtagtncctc ntttccttgt ncnntnntcn nctnttgnct 6 0 

tttgcggacc ctcgattcta tctcatatga gtgagaacgc ttaccagtgc agcgaatgtg 12 0 

ggaaagcctt ccgagggcac tcggactttt ctaggcatca gagtcaccac agcagtgaga 18 0 

ggccttatat gtgtaatgaa tgtggaaaag ccttcagcca gaactcgagc cttaaaaagc 240 
accaaaagtc tcacatgagt gagaagccct atgaatgcaa tgaatgtggg aaggctttta 



300 



<210> 336 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 336 

gaggacccac tcccccagga ctcctttgaa ggcgtggacg aggacgagtg ggactagcct 60 

gcgcccccgt cacctccacc tcacctgtgc tgccacttcc tagtgcacac ctcacggctc 120 

atcctcaagc tggaagatac ctctctggcc ccggcacatg tcacccctgc actcctgcct 180 

tcccgtgggc acttccacat cctctgggcc tctggcagtt cccagggact gttttcacct 240 

ctgctgtctc tggggtcagc tgctgctcat cagctgcccg ctagcatgtg gccaggggtg 300 



<210> 337 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 337 
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agacaaccca gaaacaaatt catacatcta tggtgaccac ttttgacaaa ggaatgaaga 
acatacactg gggaaaagat aatgtcttta ataaatggtg ctgggaaaac tggatatcca 
tatgcagaag aatgaaacta gacccccatc tcttagcata tacaaaaatc aaaattaatt 
aaaaagttaa atctaagacc tcaaactatg aaacagctaa aagaaaacat cggggaatct 
ctccaggaca ttggagtggg caaagatttc ttgtgtaata cctgacaaac aggcaaccaa 



60 
120 
180 
240 
300 



<210> 338 
<211> 292 
<212> DNA 

<213> Homo sapiens 
<400> 338 

tcaataacca tgaagatgca tcctaccacc gtcagggcaa tcattagata gctgatcttc 6 0 

actcgcatct tcatggttat tgagggcaag aaggctgccc aaagacacga gactttaaca 120 
agcttgaact tagaaaagaa agctcgtctg aaagaggaag cagctatgaa ggccaaaaca 18 0 

gagtagcaga ggtatccgtg ttggctggat tttgaaaatc caggaattat gttataacgt 240 
gcttgtatta aaaaggatgt ggtacgagga tccatttcat aaagtatgat tt 2 92 

<210> 339 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 339 

gaaatttgca ctgatggctc agaaggctta cgtcatggag agtatgacct acctcagagc 60 
agggggggct ggaccaacct ggctttcccg actgctccat cgaggcagcc atggtgaagg 120 
tgttcagctc cgaggccgcc tggcagtgtg tgagtgaggc gctgcagatc ctcgggggct 180 
tgggctacac aagggactat ccgtacgagc gcatactgcg tgacacccgc atcctcctca 240 
tcttcgaggg aaccaatgag attctccgga tgtacatcgc cctgacgggt ctgcagcatg 300 

<210> 340 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 340 

ctcagngcan cgatcatggc tcagtgcagc ctcaaactct tgggctcaan canagcgggn 6 0 

acctcaacct cctgagtagc taggactata ggcacacagc accatgcccc ggctattttt 12 0 

ttattttgta gagatggggt ctcactatgt tgcccaggct agtcttgaac tcctggcctc 180 

aagcaatcct cccacctcgg cctcccaaag tgctgggatt aaaggcgtga gccaccgtac 24 0 

ctggcccttg gtggaatctt tagggttttc tattcataca tataaaatca tatcattggc 300 

<210> 341 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 341 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg ttgggtatgg 6 0 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 12 0 

ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttt ttnnnnnnnn 180 
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nnnnnnnnnt tnncccccng ncngnanngc aggggcccaa nntnggntnn ntgnanccnc 24 0 

cnccnccggg ntnnncccct ttntcnngcc taacccnccc nagnacnngg aactac 296 

<210> 342 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 342 

ggcacgatca tggctcattg cagcctctaa ctccggggct caagcaatcc tcccacctca 6 0 

gcctaccaag tagctgtgac cacagctgcc cctcaccatg ctaagctaat ttttttaatt 12 0 

agatagtaca taaacgtccc aaaattagaa gataaaaaga catgagggat ccattctaat 180 

ttgtgtttgg agtgtaatgg tccagctcca ttcttctgca catggatatc cagttttaca 24 0 

caacactgtg aatgtaatga atgccactga atcatacact caaaaatagc taaaatggca 300 

<210> 343 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 343 

gttttcatca ctacatattc tacacacact gggaagctct gacaacttat tccctgctat 60 

tatcaactaa agatcaccct ttctactgct gtctctggag caggagctgg caaactatgg 12 0 

cctgctgtct gtttttgtac agttttactg aaacacagcc atgcccattt gtttactcat 18 0 

tgtctatggt tgctttcatg ccctcacagc aaaggcgagt agttgtgatg gatcaaatgg 24 0 
cccacaaagc ctgaaatatt tactctttga ccctttacag aaaaaaacct tgttgacccc 



300 



<210> 344 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 344 

ccccaacctg cactctaccc acccccatca cctactccag ctcccaactt ttgtggactg 6 0 

agcggccgca gagactgggt cgccttggat tccctctgcc tccgaggacc ccaaaagaca 12 0 

cccccaaccc caggccagcc ggccctgctc tggcgcgtcc aaaatactac ctagcacagg 18 0 

cctctgctcg aggcaccccc aaactaccta tgtatccagc cccagagggc ctccattccc 24 0 

aggaagtccc tatgtatccc aacactggca gacacccagc accaccctcc cagacccgca 300 

<210> 345 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 345 

cccccatcac ctactccagc tcccaacttt tgtggactga gcggccgcag agactgggtc 6 0 

gccttggatt ccctctgcct ccgaggaccc caaaagacac ccccaacccc aggccagccg 12 0 

gccctgctct ggcgcgtcca aaatactacc tagcacaggc ctctgctcga ggcaccccca 180 

aactacctat gtatccagcc ccagagggcc tccattccca ggaagtccct atgtatccca 24 0 

acactggcag acacccagca ccaccctccc agacccgcaa gaaagtgaat ctcactacta 300 
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<210> 346 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 346 

gtccacggtg ctgaacatca tcatctttga agactgtagg aaccagtggt ctatgtcccg 6 0 

accactactt ggcttgatat tgcttaatga aaagtatttt tctgacctaa gaaacagtat 120 
tgtgaacagc cagccaccgg agaagcagca ggccatgcac ctgtgttttg agaacctgat 18 0 

ggaaggcatc gagcgaaatc ttcttacgaa aaacagagac aggttcaccc agaacctgtc 24 0 

agcattccgt cgagaagtca acgactcaat gaagaattcc acttatggcg tgaatagcaa 300 

<210> 347 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 347 

gctctgagcc caggcgaggc cagggacatg gccatggacc tgtgtcggca ggaccccgag 6 0 

tgtgagttct acttcagcct ggacgccgac gctgtcctca ccaacctgca gaccctgcgt 12 0 

atcctcattg aggagaacag gaaggtgatc agaccccatg ctgtcccgcc acggcaagct 18 0 

gtggtccaac ttctggggcg ccctgagccc cgatgagtac tacgcccgct ccgaggacta 240 
cgtggagctg gtgcagcgga agcgagtggg tgtgtggaat gtaccataca tctcccaggc 3 00 

<210> 348 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 348 

gttctgtggc tggcatggtc tgcctgctac tggagagatc tcctgagaat tcaggtttgg 60 
attggtgctg tcatcttcct gggaatgctt gagaaagctg tcttctatgc ggaatttcag 120 
aatatccgat acaaaggaga atctgtccag ggtgctttga tccttgcaga gctgctttca 180 
gcagtgaaac gctcactggc tcgaaccctg gtcatcatag tcagtctggg atatggcatc 24 0 

gtcaagccac gccttggagt cactcttcat aa^gttgtag tagcaggagc cctctatctt 300 

<210> 349 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 349 

gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag ggnattggtg 60 
gcacttcaaa gatcatgaag agcaagataa agtcagacct aaagccaaaa ggaaagaaga 120 
accaagctct atttttcaga gacaacgtgt ggatgcttta cttttagacc tcagacaaaa 180 
atttccaccc aaatttgtgc agctaaagcc tggagaaaag cctgttccag tggatcaaac 240 
aaagaaagag gcagaaccta taccagaaac tgtaaaacct gaggagaagg agaccacaaa 3 00 



<210> 350 
<211> 270 
<212> DNA 
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<213 > Homo sapiens 



<400> 350 

ccatgctgnt aacgggtttc aaggggactc ttgaggaant gccccctaaa atagaacaca 6 0 

gcaatanggn gggcttcctg tccccaggnc caccccacag tgctntntgg cactggnaac 12 0 

tctgctangg agngantgna nnnnaccant aannnnnnan nnatcnacan nnnnnnnncn 18 0 

nnnnnncntn tnnccnanrm ntannctncc ntannnnanc cnnccannan cactcncnat 24 0 

naacgnnnnn ttantgagan nttctcaact 270 

<210> 351 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 351 

aaatgactcc ctgcaaaacc caacccatgc tgctggctgt gggatttttg gtgtaagcct 60 

atctatgcac tctatcagcc agaatttggc atttagctct tagttaaatc tagtaaagga 12 0 

cagtctattg tttaaagaga aggtgcattt gttcctcaat caagcaagag cacctgtgtt 18 0 

gtactgcttt atatctcatg tatatttata gtaatgaaaa gactttttaa attgtacacg 240 

tttcagtgcc tttcttgtgt tatgaaaggc aggtagatat tatagccata ggtaaaaatc 3 00 



<210> 352 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 352 

aagaaatgcc tctatgtagg tgaagtgttc tctctgcatg caacagtaaa aattaatata 6 0 

atattttccc cacaaaagaa acacttaaca gaggcaagtg caatttataa atttatatct 12 0 

aaaggggaat catgattata agtccttcag cccttggact ctaaattgag gggattaaaa 18 0 

agaatttaaa ataattttga acgaatttat tttcccctca gtttttgagg gcattaaaaa 240 

ggcattaaat caagacaaat catgtgcttg agaaaaataa aattaatgaa aacacagcac 300 

<210> 353 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 353 

cccacactcg gacactgtgg aattctacca gcgcctgtcg accgagacac tcttcttcat 6 0 

cttctactat ctggagggca ctaaggcaca gtatctggca gccaaggccc taaagaagca 12 0 

gtcatggcga ttccacacca agtacatgat gtggttccag aggcacgagg agcccaagac 18 0 

catcactgac gagtttgagc agggcaccta catctacttt gactacgaga agtggggcca 24 0 

gcggaagaag gaaggcttca cctttgagta ccgctacctg gaggaccggg acctccagtg 300 

<210> 354 
<211> 299 
<212> DNA 
<213> Homo sapiens 



<400> 354 
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gaaggaggac ctaggcacac acatatggtg gccacaccca ggagggtagt ggggagttag 6 0 

atttcagagt ccaggcccta ggttgggacc cactccaaat aatctcctcg gtgtgggtgg 120 

tggttctata gagggataaa tgaataataa acattgttaa aatatacgaa aaaaaaaaaa 180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 24 0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaacnncn ncnananaaa aaaaaaaaa 2 99 

<210> 355 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 355 

actgttcatc ctaagttcca ctataaacag gctcatgact cgggcacaga cacttcttgc 60 

gtgacttttt cctatgatgg taatgtcctt gcctctcgtg gaggtgacga ttcattaaaa 120 

ttatgggaca tccgacaatt taataaacca cttttttcag cctcgggtct tcccaccatg 180 

ttcccaatga ctgactgctg tttcagtcca gatgataagc tcatagtcac tggtacatct 240 

attcaaagag gatgtggcag cggcaaactt gttttctttg agcgtaggac tttccaaagg 300 

<210> 356 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 356 

ttcagaaaga aacatttaat agggacttac aaacaaatta atgtctgagt ctcaggtggc 60 

agcaagacaa gatggtggat ccccatgcca ttacctgcta gactcagggt ttatatactg 120 

tagtggagag gtgattccga aggaatgttg taagacaatt gaagagcagt aacatcaaag 180 

ttatttgacc taagggcagg agttacagta agtatccact tttatacaag aaacaataga 24 0 

taaactggaa atcttggagc ccttcctgga actggggtta atgagaagtc aacatggtgg 300 

<210> 357 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 357 

acaaaaccta cagatggaga taaaaattac tactgttatt caacatgtgt tccagaacct 6 0 

tattttgggg agtaaagtca attgggcaga ggatcctgcc cttaaggaaa ttgttctgca 12 0 

gcttgagaag aatgttgaca tgatgtaata agaattcatt tctgacatat tttacatttc 180 

tggcaatctc aactcttatt tggaatactt ctgtgcattt gtctgtccac cgtaatttta 240 

gaaaagcata tccataacgt ttacagttgt agtacagttg tggttagtta tttgtagtgg 300 

<210> 358 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 358 

ggtgattaca gaagcccaga aggttgatac cagagccaag aacgctgggg ttacaatcca 6 0 

agacacactc aacacattag acggcctcct gcatctgatg gaccctgcac ttgatggacc 120 

agctggcacc acccagatca ataaactggc ttatttgaat ttgcggcccc ccacccagga 180 
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actgactcag tgcaagaaga cagcttcgac tccctgtgat ttcatctctg accaatccgc 24 0 

actcctggct cactggcttc cccaacccat gaagttttcc ttaaaaactc tgctcccgaa 300 

<210> 359 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 359 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 60 

aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 12 0 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 18 0 

ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 

tgagcagggc caacaggagc tgaagaactt gcgtgcttgn gtcctggagc tggaagagcg 300 

<210> 360 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 360 

tctgtctggt gatttttatt ttaagtgaac ctttggatct atctttaact ctctttattg 60 
tgagtggtaa attccaattc tgcagcagat cagtaaactc acagtatttt tcctgtggaa 120 
atctattcaa taaggaaacc aagacaggat aataaaattt aaaaaaaaac aactttgaat 180 
tcccctgcct aggtcttcca gttgttttcc agcgcatacc tcaggtatga ctttgctagc 240 
cggggacaaa attagcacct tccgattctc tagtccaaat gaactttgtg ctaaataaaa 3 00 

<210> 361 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 361 

gtagaacaga aaatgagcat ccgatttctt cactaaagga gaccaaactg ttccttgcgg 6 0 

tctagtattg aagaactgga acttgaaagt cctccttcta ccaactccac ctccaccccc 120 

tcattcccct tctcccaaag tactactgct gttgcatgac aaccccaaat atgttctgtc 180 

aacacaaacc tgcctttggt gtataaacag ggcattacag aatggtacac cctatatatt 24 0 

tctgttcagt atccattcac tagttcttca tttataaata tcatcttccc cattctgctg 300 



<210> 362 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 362 

actaccccgg ctacggttcc cccatgcctg gcagcttggc catgggcccg gtcacgaaca 6 0 

aaacgggcct ggacgcctcg cccctggccg cagatacctc ctactaccag ggggtgtact 12 0 

cccggcccat tatgaactcc tcttaagaag acgacggctt caggcccggc taactctggc 180 

accccggatc gaggacaagt gagagagcaa gtgggggtcg agactttggg gagacggtgt 24 0 

tgcagagacg caagggagaa gaaatccata acacccccac cccaacaccc ccaagacagc 3 00 
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<210> 363 
<211> 271 
<212> DNA 

<213> Homo sapiens 
<400> 363 

ggcaattagc ctcgcttaag ttgccttttt tacacaccaa aactttttac atgaagggct 60 
ggtttcacat gaatactata ctgaaatctg tgctctcaag atctagcagt gaccagggct 12 0 

gcccggcggg ggctctcctg gcaagtcagg aaggtnnnnn nnnnnnnnnn nnnnnnnnnn 18 0 

canattantn nctgatcntc tntnangaan nnngantngc tctnttggnc nttgtnnnnn 24 0 

gncntnnnnt naantntttn ntnatgtngc t 2 71 

<210> 364 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 364 

agaggaccct gcagttaggg ggtgttactt tgtcgcccag gatggcctgg acccccaggt 60 
tcagggattc tcccgccgct gcttcctgag tagctgggac ctcaggcttc cgcctcgtgc 120 
ccgcatccct gctgtgttta ggcagcaggt ggtgacctca ctcctccctg gcctgagctc 18 0 

tccgtcccgc atcccaggcg gaggccctag ggaacacttt gaagctgagc acggggtgga 24 0 

ccctccctcc tgagtgaatg gagaatagaa agggagagga tttctgttct gttctgtggg 300 

<210> 365 

<211> 300 

<212> DNA 

<213 > Homo sapiens 

<400> 365 

gttcttcaaa gccaaccaag acaggcttag cagttttaga gcttcagaac aaattgccaa 6 0 

aagccagagt tgtttatgct agtgcaactg gtgcttctga accacgcaac atggcctata 120 

tgaaccgtct tggcatatgg ggtgagggta ctccatttag agaattcagt gattttattc 180 

aagcagtaga acggagagga gttggtgcca tggaaatagt tgctatggat atgaagctta 240 

gaggaatgta cattgctcga caactgagct ttactggagt gaccttcaaa attgaggaag 3 00 

<210> 366 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 366 

gccagtcctc accttcccta gtcctcgtgt gtattttagg agatgcgtgg gtgtggaaca 60 

gcctcctgcc tccggtccag gtgtactggg gtctgtgtgt tgtgtttctg cgtgttctcg 12 0 

gcagaaagtg gcatgctgtc ccgcctgggt gatttgctct tttacactat tgctgaagga 180 

caggaacgaa tccctatcca caagttcacc actgcactaa aggccactgg actgcagaca 240 

tcagatcctc ggctccgaga ctgcatgagc gagatgcacc gcgtggtcca agagtccagt 300 



<210> 367 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 367 

cattgccaga gagcggtttc agcaagctgc agatctgatt gatgctgagc aacgaatgaa 60 

gaagtccatg tggggtcagt tctggtctgc tcaccagagg ttcttcaaat acttatgcat 12 0 

agcatccaaa gttaaaaggg ttgtgcaact agctcgagag gaaatcaaga atggaaaatg 18 0 

tgttgtaatt ggtctgcagt ctacaggaga agctagaaca ttagaagctt tggaagaggg 24 0 

cgggggagaa ttgaatgatt ttgtttcaac tgccaaaggt gtgttgcagt cactcattga 300 

<210> 368 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 368 

gcccggcccg gcgacgctgg cgacgctttc gcccctgagg tagtttggcg accgcgaaga 60 
aggaaaaagg gcgggcgggc ggctgtcctc tcaccgtcct caccccgcga ggcccggccc 120 
gctcctccgt cgtggatttc gcggcgatcc ccccggcagc tctttgcaaa gctgcttgaa 18 0 

acttctccca aactcggcat ggatacgact gcggcggcgg cgctgcctgc ttttgtggcg 24 0 

ctcttgctcc tctctccttg gcctctcctg ggatcggccc aaggccagtt ctccgcaggt 



300 



<210> 369 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 369 

gtggggtgtg cctcgtgtgc gtggattcgt gtgtgtgtgt gtgtcttgta tatgtgtgcg 6 0 

cagagtgcat cattttcaga ctctactatt tccgtcaagt attctgtttg atttggatca 120 
tctcaggatc ggattctgtt ttagagtgtt tctgggccag gatccgggcc cctgccctcc 180 
tctgcacctg accacactcc ctactcaggg ctagtctgtt cttcccggac atcttctggt 240 
agccgtgcag gagagggctg ggtggggcag aggccacaga ggggacctgg tgtgtcacct 3 00 

<210> 370 
<211> 273 
<212> DNA 
<213> Homo sapiens 

<400> 370 

cagaggctgg ttcagaaaag gaggaagagg cccggctggc agccctggaa gagcagagga 6 0 

tggaggggaa gaagcccagg gtgatggcag gcaccttgaa gctggaggat aagcagcggc 12 0 

tggcccagga tgaggagagt gaggcctagc gcctggccat tatgatgatg aagaagctnn 18 0 

nnnnnnnnnn nnnnnnnnnc atcatgtccn ntgcatggct acctatccca tatttnatnt 24 0 

ccctnncgtt gnttcnaatt ncacattntc ttt 273 

<210> 371 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 371 
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gatgaggagt gtttaatcat tgatacagaa tgtaaaaata atagtgatgg aaagacagct 
gttgtgggtt ctaacttaag ttccagacca gctagtccaa attcttcctc aggacaggct 
tctgtaggaa accagactaa tactgcttgt agtcctgaag agtcatgtgt tttaaaaaaa 
cctatcaaac gagtatataa aaaatttgat ccagttggag agattttaaa aatgcaggat 
gagctcttaa agccaatttc cagaaaagta ccagaattgc ccttaatgaa tttagaaaat 



60 
120 
180 
240 
300 



<210> 372 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 372 

gggccccaat gcagctgccc tctccagata cctggcagcc tcatatatca gccaaagcct 60 

ggctcggcgg caggggcctg ggggaggggc ccccgcagcc tcccggggct cctggtcctc 120 

tgctcccacg tcacgggcat cttcgccgcc cccccagccc cagccaccac ctcccgcagc 180 

caggcggctc agctatgcca cgacggttaa catccacgtg ggcgggggtg ggcggctgcg 24 0 

gccagccaag gcccaggtcc ggttgaacca ccctgctctc ttggcctcca cacaggaatc 300 

<210> 373 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 373 

accctttctg ccttctgttt gggacccagc tggtgttctt tggtttgctt tcttcaggct 60 

ctagggctgt gctatccaat acagtaacca catgcggctg tttaaagtta agccaattaa 12 0 

aatcacataa gattaaaaat tccttcctca gttgcactaa ccacgtttct agaggcgtca 180 

ctgtatgtag ttcatggcta ctgtactgac agcgagagca tgtccatctg ttggacagca 240 

ctattctaga gaactaaact ggcttaacga gtcacagcct cagctgtgct gggacgaccc 3 00 

<210> 374 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 374 

tcaaggccta cgaacaggtg atgcactacc ccggctacgg ttcccccatg cctggcagct 6 0 

tggccatggg cccggtcacg aacaaaacgg gcctggacgc ctcgcccctg gccgcagata 120 

cctcctacta ccagggggtg tactcccggc ccattatgaa ctcctcttaa gaagacgacg 180 

gcttcaggcc cggctaactc tggcaccccg gatcgaggac aagtgagaga gcaagtgggg 240 

gtcgagactt tggggagacg gtgttgcaga gacgcaaggg agaagaaatc cataacaccc 300 

<210> 375 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 375 

cttcagtgca cacaacagga gagaggagaa agaagaaacg ctagtaattc caagcactgg 60 

aattaagttg ccttcatcag tgtttgcttc agagtttgag gaagatgttg tgattgttaa 120 

ataaagcagc tccagtttca ggacctcgac tggattttga tcctgacatt gttgcagctc 180 
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ttgatgatga ttttgacttt gatgatccag ataatctgct tgaggatgac tttattcttc 240 

aggccaataa ggcaacagga gaggaagagg gaatggatat acagaaatct gagaatgaag 30 0 

<210> 376 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 376 

gggagactgg ggtctatttc acccctgcag tctcgaccat aagagatggc tacacccagg 60 

ggggccagtt cagagaccca ctcccaggtg tgcattctct ttctcaagga tgttccttgc 120 

tgagaaaaag aattcagtga tatttctccc atttgcttgt gaaagaagag aaatgtggct 180 

ttgttccacc tggctcaccg gcggtcagaa tttaaggtta tctctcttgt ttcctaaaca 240 

ttgctgttat cctgttcttt tttcaaggtg cccagatttc atattgctca aacacacatg 300 

<210> 377 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 377 

gatcagccca cctcggcctc acaaagtgct gggattacag gcgtgagcca ccttgcccag 6 0 

cccacatcat acagtttgaa atgaaacttt gccacaacca gcctttgctg tagcacacac 120 

atatatcact gaacctgttt gaaataaagt tttttttctt tttcctctgg tattctgggt 180 

tctgaagtct ggtattctgg tattctgggt tcaaaagtat gacttgagag tgttgctctg 240 

gtattctgag agttgctctg tattctgggt tctgaagatt atttgaaaaa taactcctac 300 

<210> 378 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 378 

tcgctgtgat ccaaggataa aaaagttcaa ggaagaagaa aaagccaaga aagaagcaga 6 0 

aaagaaagca aaagcagaag ctaaacggaa ggagcaagaa gctaaagaaa aacaaagaca 12 0 

agctgaatta gaagctgctc ggttagctaa ggagaaagaa gaggaggaag tcagacagca 180 

agcattgctg gcaaagaagg aaaaagatat ccagaaaaaa gccattaaga aggaaaggca 240 

aaaacttcga aactcatgca agacctggaa tcatttttct gataatgagg cagagcgggt 300 

<210> 379 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 379 

acactataga atacaagcta cttgttcttt ttgcaggatc ccatcgattc gaattcggca 60 

cgaggcagct tcgagccaat ggtgagctcc ttctggatca gctccttcag ctccttcttg 120 

ctcaggatgc tgaaattgca aggctgatgg aagacttgga ccggaacaag gaccaggagg 18 0 

tgaacttcca ggagtatgtc accttcctgg gggccttggc tttgatctac aatgaagccc 24 0 

tcaagggctg aaaataaata gggaagatgg agacaccctc tgggggtcct ctctgagtca 300 
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<210> 380 
<211> 296 
<212> DNA 
<213> Homo sapiens 

<400> 380 

acctggacag ggccagctgc tgggggagcg gcactgggga ctggaggctg gaagcgggtg 6 0 

gtgtgtgtcc cctgtttact tttagctgag ctggggttgg gtgtacgggt tctgttcctc 12 0 

tgagccctgc ggcccacctg atgtttacgt gtgtgtgtga gggggggcnn nnnnnnnnnn 180 

nnnnnnnnnn ngtnatangc ttaacanatg nanagncnac tnactnctga ttntttatnc 240 

atttgtgcat tnnaactatg cttttncgat cttnctgntg nnatnacngg catgat 296 

<210> 381 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 381 

cagaaaagag tatagtaggg atgaccaagg tcaaagtggg taaagaagac tcatcatcca 6 0 

ctgagtttgt agaaaaacgg agagcagctc ttgaaaggta tcttcaaaga acagtaaaac 12 0 

atccaacttt actacaggat cctgatttaa ggcagttctt ggaaagttca gagctgccta 180 

gagcagttaa tacacaggct ctgagtggag caggaatatt gaggatggtg aacaaggctg 240 

ccgacgctgt caacaaaatg acaatcaaga tgaatgaatc ggatgcatgg tttgaagaaa 3 00 

<210> 382 

<211> 300 

<212> DNA 

< 2 1 3 > Homo s ap i ens 

<400> 382 

gccaccggtc tcttcctaat ctgcacagac tattttgggt atttctgggc gggcagttcc 60 

tttgcatgtt tcgggagagg tttgctgatt tggggcttat atgtcaggcc tttggtttgc 12 0 

gtcttatttt aggggttgtt tgggggcctg ggtggtcggc ctcacatggg aaggggatgg 18 0 

gtagtggatg gggtttctgt tgtatcttgt gggcgggtga ttttgctttt gtttttgttt 24 0 

cacattcttc cccctccaca agccaaagtc gtttcatttg gtttccactg tgtggactgt 3 00 



<210> 383 
<211> 273 
<212> DNA 
<213> Homo sapiens 

<400> 383 

gagatttgat attcgagtgc tgggcttagg tctgttgata aatctagtgg agtatagtgc 60 

tcggaatcgg cactgtcttg tcaacatgga aacatcgtgc tcttttgatt cttccatctg 12 0 

tagtggagaa ggggatgata gtttaaggat aggtggnnnn nnnnnnnngc cngcnttnac 180 

ttnatngcnn ctttttcttg atcnacgncn gnnatncnna nnngtntata ntaatncnga 240 

ananttnttt gnnntgcttt atcaantntt cnt 273 



<210> 384 
<211> 259 
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<212> DNA 

<213> Homo sapiens 
<400> 384 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 12 0 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 18 0 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

cgaagagaaa gctggagtc 2 59 

<210> 385 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 385 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 12 0 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcat agcgccacgg gcagtgtggc ttgctgcccc ccaggaggcc tgaggctggg 24 0 

tctcactgct ctgaaaaaga cccnnccaaa atgggccttg gggctnnagg cccttg 2 96 

<210> 386 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 386 

gaagaggagg ctgtgtatga ggaacctcca gagcaggaga ccttctacga gcagccccca 6 0 

ctggtgcagc agcaaggtgc tggctctgag cacattgacc accacattca gggccagggg 120 
ctcagtgggc aagggctctg tgcccgtgcc ctgtacgact accaggcagc cgacgacaca 180 
gagatctcct ttgaccccga gaacctcatc acgggcatcg aggtgatcga cgaaggctgg 240 
tggcgtggct atgggccgga tggccatttt ggcatgttcc ctgccaacta cgtggagctc 300 

<210> 387 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 387 

ccgcagaggg cctggaagag gtgctcacca cgccagagac tgtgctcaca ggccacacgg 6 0 

agaagatctg ctccctgcgc ttccacccac tggcagccaa tgtgctggcc tcgtcctcct 120 
atgacctcac tgttcgcatc tgggaccttc aggctggagc tgatcggctg aagctgcagg 180 
gccaccaaga ccagatcttc agcctggcct ggagtcctga tgggcagcag ctggccactg 240 
tctgcaagga tgggcgtgtg cgggtctaca ggccccggag tggccctgag cccctgcagg 3 00 

<210> 388 

<211> 300 

<212> DNA 

<213> Homo saoiens 
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<400> 388 



tggaggtctc ctttcgcccc agcccaggtg gccaagccca 
gagcacattt tgtagggtgg caccttttta tccaagttac 
aagagaaaaa agtgaccttt catttttttt tcttgaaact 
tactgatttt ttttttctta aaactaaatg catgactgca 
tttcatactg tggggcaaag tatttgtgct gctttttgga 



tcctggcctc 
tagctacaca 
tgaggaaaca 
gagcggtaga 
gatggactgg 



agaacatgct 
tcagtgttta 
agatacatac 
ggtgtatatt 
aacgtctggt 



180 
240 
300 



120 



60 



<210> 389 
<211> 293 
<212> DNA 

<213> Homo sapiens 
<400> 389 

gtcaagctgg ccctggatgt ggagatcgcc acctaccgca agctgctgga gggcgaggag 6 0 

tgcaggctga atggcgaagg cgttggacaa gtcaacatct ctgtagtgca gtccaccgtc 12 0 

tccagtggct atggcggtgc cagcggtgtc ggcannnnnn nnnnnnnnnn nnnatgaanc 180 
agntactcct atggnnttag cnttntanct atnacctgcn cnaactannc tnangtgcta 24 0 

gnncttgccc caacccctac ttttgtattt atattgtgtg tgcgtgtgtg cgt 293 

<210> 390 
<211> 300 
<212> DNA 

< 2 1 3 > Homo s ap i en s 
<400> 390 

ctcacacctg ctttggatgc ttcaagcacc tcagccctct gaactacaaa acagangagc 60 
ctgcaagtga caaaggaagt gaggcagagg cccacatgcc cccaccgttc acaccctacg 12 0 

tgcctcggat tctgaacggc ttggcctcgg agaggacagc actgtctccg cagcagcagc 180 
agcagcagac ctatggtgcc atccacaaca tcagcgggac tatccctgga cagtgcttgg 24 0 

cgcagagcgc cacgggcagt gtggctgctg ccccccagga ggcctgaggc tgggtctcac 3 00 

<210> 391 
<211> 257 
<212> DNA 

<213> Homo sapiens 
<400> 391 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 
ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 12 0 

ctctcctctc tccttctctc tctgcccgga gctggtttcc gtctctcggc tcggggctgg 18 0 

aactccggcc caacctaggc gcgcagccgc cacgagatgg cgcacttccg atcaatgtca 24 0 

aagccgccgg ggagccc 257 

<210> 392 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 392 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 6 0 

cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 12 0 
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accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 3 00 

<210> 393 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 393 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 6 0 

cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 12 0 

accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 3 00 

<210> 394 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 394 

gtccatacat ggagctccct ggagcccgtg tgctctcgtg tgactgaacg ttttgtgatg 60 

aaaggaggag aggctgtctg cctttatgag gagccagtgt ctgaattgct gaggagatgt 120 

gggaattgca cacgggaaag ctgtgtggtt tccttttacc tttcagctga ccatgaactc 180 

ctgagcccga ccaactacca cttcctgtcc tcaccgaagg aggccgtggg gctctgcaag 240 

gcgcagatca ctgccatcat ctctcacnag gngaccatat tggtttttga cctggagacc 3 00 

<210> 395 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 395 

gcaaaatcaa tgtggactga acataaatca cctgatggaa ggacttacta ctacaacact 60 

gaaaccaaac agtctacctg ggagaaacca gatgatctta aaacacctgc tgagcaactc 12 0 

ttatctaaat gcccctggaa ggaatacaaa tcagattctg gaaagcctta ctattataat 180 

tctcaaacaa aagaatctcg ctgggccaaa cctaaagaac ttgaggatct tgaagcaatg 240 

atcaaagctg aagaaagcag taagcaagaa gagtgcacca caacatcaac agccccagtc 3 00 



<210> 396 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 396 

aagagcacaa gaggaagaga gagaccctca ctgctgggga gtccctgcca cactcagtcc 60 
cccaccacac tgaatctccc ctcctcacag ttgccatgta gaccccttga agaggggagg 120 
ggcctaggga gccgcacctt gtcatgtacc atcaataaag taccctgtgc tcaaccaaaa 180 
aaaaaaaaaa aaaaacnnnn nnnnnnnnnn nntntngggn gnctnntnnc nnaaanccan 240 
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ncttnataaa anccttngnt natttggaac aacccncann taaanngcag ggaaaaaaag 300 

<210> 397 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 397 

gataaatacc tcagcccctc gccttcctca acccacctgg caagtcttct taggatctga 60 

tcccagtttt ctggaagcaa tcctacccca gcccaagctt cccagagtcg agccttaatc 120 

cttctcactt ctcagtgtca gagcagaaat gaatcctggg gttgactgtg tccattcggg 18 0 

ttattagcag ctaagaagcc cagacgagta gtgtgagctg ccttgggagc ctcagtgagg 24 0 

gcactgggac tggcctcact ctcttgcccc cagcctagtg ggctttctcc tctgtctctc 300 

<210> 398 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 398 

ctgaacccta aaggaaagcc agcaaaccag ctgcttgctc tcaggacttt ttgcaattgt 6 0 

tttgttggcc aggcaggaca aaaactcatg atgtcccaga gggaatcact gatgtcccat 12 0 

gcaatagaac tgaaatcagg gagcaataag aacattcaca ttgctctggc tacattggcc 18 0 

ctgaactatt ctgtttgttt tcataaagac cataacattg aagggaaagc ccaatgtttg 24 0 

tcactaatta gcacaatctt ggaagtagta caagacctag aagccacttt tagacttctt 300 

<210> 399 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 399 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 6 0 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 12 0 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 18 0 

tgtgctgaca tgcataagat gctgtttgag gaacgaaatc attttgccga gatagagaca 24 0 

gagttacaag ctgagctggt cacaatggag caacagcacc aagagaaggt gctgtacctt 3 00 

<210> 400 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 400 

ggctagcgat ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcaccgagt 6 0 

ttctggagtt cgaatttcgg ccggacggaa agcttagata tgccaacaac agcaattaca 12 0 

aaaatgatgt gatgatcaga aaagaggctt atgtgcacaa gagtgtaatg gaagaactga 180 

agagaattat tgatgacagt gaaattacaa aagaagatga tgctttgtgg cctceccctg 24 0 

atagggttgg ccgacaggag cttgaaattg taattggaga tgagcacata tcttttacca 3 00 

<210> 401 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 401 

acccccttca tggacagatc ccccacagcc tggggcagaa gaggcgtcga gggcgccaga 60 

agtggcggca gcagcagccg cagcagccaa agagaggcaa gagaaagaga aagcgggcgg 12 0 

tggaggggtc ccggaagagc tggtccccgt ggttgagctg gtccccgtgg ttgaattgga 180 

agaggccata gccccaggct cagaggccca gggcgctggg tctggtgggg acgcgggggt 24 0 

gcccccaatg gtgcagctgc agcagtcacc actagggggt gatggagagg aagggggcca 300 

<210> 402 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 402 

ggatcctttc cagacagaag accccttcaa atctgaccca tttaaaggag ctgacccctt 60 

caaaggcgac ccgttccaga atgacccctt tgcagaacag cagacaactt caacagatcc 120 

atttggaggg gaccctttca aagaaagtga cccattccgt ggctctgcca ctgacgactt 180 

cttcaagaaa cagacaaaga atgacccatt tacctcggat ccattcacga aaaacccttc 240 

cttaccttcg aagctcgacc cctttgaatc cagtgatccc ttttcatcct ccagtgtctc 300 

<210> 403 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 403 

aattccgttg ctgtcggaca gattgcccta gtacccaccc acctatcagg gttatgcaat 60 

ggaacatcct cgcccaagct cttggagaag gcaaagacaa ctttgtacag tgccctgttg 12 0 

aagcactcaa atgggaagaa aggaaatgtc tcatcctgga agaaatcctg gcctaccagc 180 

ctgatatatt gtgcctccaa gaggtggacc actattttga caccttccag ccactcctca 240 

gtagactagg ctatcaaggc acgtttttcc ccaaaccctg gtcaccttgt ctagatgtag 300 

<210> 404 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 404 

agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa cttaacgaga 60 

ctcgtgccaa acttgatgag ctttctgcta agcgagagac tagtggagaa aaatccagac 120 

aattaagaga tgctcagcag gatgcaagag ataaaatgga ggatatcgaa cgccaagtta 180 

gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt agtgctgaaa 24 0 

gataagagca gattaagcag aggactaagt tggagcttaa agccaaggat ttacaagatg 300 

<210> 405 
<211> 856 
<212> DNA 
<213> Homo sapiens 

96 • 



BNSDOCID: <WO 9333982A2_L> 




WO 99/33982 



PCT/US98/27610 



<400> 405 
tggtgccngt tcctattccg 
gcacgaggaa ggaggaccta 
gagttagatt tcagagtcca 
tgggtggtgg ttctatagag 



tgctntcgtn ctncnccagg ancnangcgt ntcgaattcg 
ggcacacaca tatggtggcc acacccagga gggtagtggg 
ggccctaggt tgggacccac tccaaataat ctcctcggtg 
ggataaatga ataataaaca ttgttaaaat atacgaaaaa 
aaaaaaaaan aaaaaaaana aaaaaaaaaa anaanaaaaa 
annanaaaaa aaaaaaaaaa aanncccctn cnccctaaaa 
tccannccnn ntntttaata nnctncttnt tgnntcttng 
ggcnntaana naaaatnttn nttttttttn ggntanaaat 
ttttttttcn aaaccctcct nttntanctc ncgtntcnaa 
nttnnntnnt nctntttcta tttttnnttc ttntncaann 
nntgnggctt gtttnttttt ncnncctngc gtcatccnnc 
nannccnnat tttttntnnc ctctatntnn gnngngnnat 
atnantagtc ntntnttttn ttntccntng tnatannatt 
ctcannnnat ttnntnnncn ncgngntata tttnangnta 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
856 



60 



aaaaaaaaaa aaaaaaaaaa 



aaaananaaa aatnaaaaan 
nattcngggg ggntttttcc 
nctcaccnnt tcttttggtn 
ncnntnncng ttttttntnn 
aaanntnttt ntccncnncn 
ttccnangtg nnnngngtnt 
caataatttc ttnncncccc 
atnantcccc tttattnttn 
ttntntcccn ntntaanttc 
nntcnncggg gttnct 



<210> 406 
<211> 843 
<212> DNA 

<213> Homo sapiens 
<400> 406 

tnntnnnnnc gnangctggn nnnttctncc cntttctaat ngttnctaat actanggatn 6 0 

gtcacgaggn tcccangtag gcatagcgca ctgctgtacg tttttggttg tttttaagaa 12 0 

actcgatgaa gaggggtgtc attctgggct cggggtggtt gccaattttt caccagaaag 180 

ggagccaccc cttgcaacca cttctgtctc cgttagcccc ccctctgccc tcctccaagc 240 

caaagcgtgg cctggctttt gtcttcccat ttagttttcc tcttttaccc ttccttttgt 300 

gcttaattta ttaaaatagt tgctgtataa tttattttca taaactataa aaaaatacta 360 

aatggttaaa atagacttgc aggccaatct taaatggggt gggaggggtc tgagggtggg 42 0 

atggggaaag ggaaagaggt tttgatntaa acaaaacaaa tgcactttgg gtgtgtnnng 4 80 

gnattttnct ggggatanan ggggtggggg nnnnngnann nnnnnnnnnn nnnnnnnnnn 54 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 72 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 78 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 84 0 



<210> 407 
<211> 743 
<212> DNA 

<213> Homo sapiens 
<400> 407 

tgggcnggnn ctantngngg gctctcgact tcntacganc ttccaatggt tnggnggcac 6 0 

gagcccccat cttcactggt tattccactt atttaaaatg tccagaataa gcaaatctcc 120 
atatagagga agtagattag tggttgcttc gggatgggag gaatgggaag attgaggtct 18 0 

ttcttttgca gtgataaaaa tgtcctaaaa ttgactgtag cgatggtcac acaactctga 240 
atatgcttaa gaccattgaa ttacacactt tacgttggtg aattgtatgg tatgtaaatt 300 
atagttcaat aacatagtta caaaagataa tcaaaagcat gaaagcactg ttgatgtggt 360 
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ttggatctgt gtcctcaccg agtctcatgt tgaaatgtaa gccccctggt gggaggcgat 420 

gggattatgg ggcagagtcc tcacaaacgg tttacaccac ccgctcagtg ctggtctcct 480 

gatattgagt cctcatcaca tctggttgct tcaaagtgtg tggtgcctcc cctctatctc 540 

cctnctgctc tggccatata agatgtgcct gcttctcttc gccttctaac atgattgnaa 600 

gtttcctgag gcctncctag aacaaaactg ctgtgctttc tgnncccatc tacaggaccc 660 

ggagccaatt naaccccttt tctttataaa aaaaaannnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnt 743 

<210> 408 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 408 

tgtccgnttc ttangntgng ctctngcttc ctaggagtnt cnaatcgctt ggtgcagacc 60 

tccagcacaa gcctctgcta gtngatctca cggtagaaga aggtcaaaga ttaaaggtna 12 0 

tttttggttc acacactggt ttccatgtaa tngatgttga ttcaggaaac tcttatgata 180 

tctacatacc atctcatatt cagggcaata tcactcctca tgctattgtc atcttgccta 240 

aaacagatgg aatggaaatg cttgtttgct atgaggatga gggggtgtat gtaaacacct 3 00 

atggccggat aactaaggat gtggtgctcc aatggggaga aatgcccacg tctgtggcct 360 

acattcattc caatcagata atggctgggg cgagaaagct attgagatcc ggcagtggaa 420 

caggacattt ggatggagta ttnatgcata agcgagctca aaggttaaag tttctatgtg 48 0 

aaagaaatga taaggnattt tttgcattcc gtgcgatctg gaggaagtag cccaagtgtt 54 0 

tttcatgacc ctcaacagaa attccatgat gaacctggta accagaagaa ccccttggca 6 00 

cttatcttca tggcgttatt ctaatttaaa aagaacataa ctcatgngga cttatgccca 660 

gtctagaggc agaatcagaa ggcttgggtg gaacatatcg ntttcctttt tcctttcctt 720 

cggccctncc agnccagtcc atnttt 746 

<210> 409 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 409 

ggatccggtt tccaatgctc gggcnctcga gctncctaag annttgctaa tgcttggngg 6 0 

ngtgcctcgt gtgcgtggat tcgtgtgtgt gtgtgtgtct ngtatatgtg tgcgcagagt 12 0 

gcatcatttt cagactctac tatttccgtc aagtttctgt ttgatttgga tcatctcagg 180 

atcggattct gttttagagt gtttctgggc caggatccgg gcccctgccc tcctctgcac 24 0 

ctgaccacac tccctactca gggctagtct gttcttcccg gacatcttct ggtagccgtg 30 0 

caggagaggg ctgggtgggg cagagccagg aggggacctg gtgtgtcacc tgcccaccac 36 0 

ctggctcatc cctcangccc accctgaccc tacattacat aggttacgtc agcctactgt 420 

ggctgttgag caaagcattt ctcctttctn gggcctcatt gcactagatg ggcctgtggt 480 

cccaaagtag gtcagtaggt tggggttgct gacacccctt gggtgcaact ttgggacaag 540 

atgaantggc tctgtcctgt cactggcctc tccttgcctg ggggctatgt gcacttcaaa 600 

accctggcca agctcaagcc catgaagnat tggagaacac cctgggcccc caagaactgg 66 0 

angcacccgg ccanttcccc tgggattcca nctttgccan ggtgaaccct tcttttaccc 720 

naaacttntg tccccctgnt tccacttcca aaaanaactg g 761 

<210> 410 
<211> 748 
<212> DNA 
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<213> Homo sapiens 
<400> 410 

gatgccggtt cctatgatgn gctctcggct tcctaggagt tccaanactn ggctngcncg 60 

aggncttnta aatatatctn ggntttanta ggtgataagt nctgtcantt agtancatct 12 0 

gaaaaancag ctttgtcctg ggtgaaaaag gatgccaaaa ttgcctggaa aagagcagtg 180 

anaggagtcc gggagatgtg tgatgcntgt gaagcancat tgtttancat tcactgggtc 240 

tgccaaaaat gtggatttgt ggtctgctta gattgttnca aggcaaagga aaggaagagt 3 00 

tctagagata aagaactata tgcttggatg aagtgtgtga agggacagcc tcatgatcac 360 

aaacntttaa tgccaaccca aattatacct ggttctgttt tgacagatct tctagatgcc 420 

atgcacactc ttagggaaaa atatggtatt aaatcccatt gncattgtct aacaaacaga 480 

atttacaagt tggaaatttt cctncatgaa tggtgtatct caagtttaca gaatgtctta 54 0 

atcacagtat aaaattctct gngcatgcct gagtctcagc gccaaaatcc tcctccgaag 600 

tctgagaaaa atggtggcag cnnccccana aagtgatgtt nggcnccaga ttaccaggtt 66 0 

aacttcctcc agaatnccag tcaccactgn actggntagc anatcttgcc gagccaaaaa 72 0 

gcccnaagng ggaaaaaaaa aaaaaaaa 748 

<210> 411 
<211> 773 
<212> DNA 

<213> Homo sapiens 
<400> 411 

gnangnnngn ttcntagtgc ccgtgggagt cttagatncc ctaaaaaatt gntaatgctn 6 0 

ggtcggcacg agtcaaggcc tacgaaacag gtgatgcact accccggcta cggttccccc 12 0 

atgcctggca gctnggccat gggcccggtc acgaacaaaa cgggcctgga cgcctcgccc 180 

ntggccgcag atacctccta ctaccagggg gtgtactccc ggcccattat gaactcctct 240 

taagaagacg acggcttcag gcccggctaa ctttggcacc ccggatcgag gacaagtgag 300 

agagcaagtg ggggtcgaga ctttggggag acggtgttgc agagacgcaa gggagaagaa 360 

atccataaca cccccacccc aacaccccca agacagcaat cttcttcacc cgcttgcaac 42 0 

ccgttccgtc ccaaacagag ggccacacag ataccccacg ttctatataa ggaggaaacc 480 

gggaaaagaa tataaagtta aaaaaaaagc ctccggtttc cactactgng tagacttcct 54 0 

gcttcttcaa cacctgcaga ttctgatttt tttgttgttg gttgttctct ccattgctgn 600 

tggtgcangg aagtcttact taaaaaaaaa aaaattttgn gagtgactcg gtgtaaaacc 660 

atgttanttt taacagaacc nanaagggtt gncctattgg ttaaaaaaaa aaaaaaaaaa 720 

aaacttngng cctttagaac tattanngag nccnatttac nttaatccan net 773 

<210> 412 
<211> 774 
<212> DNA 

< 2 1 3 > Homo s ap i en s 
<400> 412 

gnanncegga ttcntagcgn tcgtggaagt geateggctg ntaacaattt getaatgett 60 

ggagttccaa ttccagagaa aatgagtgaa tgggcacctc gacctccccc agaaatttgt 12 0 

ccgagatgtc atgggttcaa gtgctggggc cggcagtgga gagttccacg tgtacagaca 180 

tctgcgccgg agagaatatc agegacagga ctacatggat gccatggctg agaagcaaaa 240 

attggatgea gagtttcaga aaagactgga aaagaataaa attgetgeag aggagcagac 300 

cgcaaagcgc eggaagaage gecagaagtt aaaagagaag aaattactgg caaagaagat 36 0 

gaaacttgaa cagaagaaac aagaaggacc cggtcagccc aaggagcagg ggtccagcag 420 

etctgeggag gcatctggaa cagaggagga ggaggaagtg cccagtttca ccatggggcg 480 
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atgacaatgt ttgccacagc ctctgcctgg aacctggctc gtgctgtgac cagaagggaa 54 0 

aggcggctgt ttggctcttt cttccccgca aggaccccgc ttacccgctg gatggagagc 600 

aaaggagacc cccttccgag cccgntcaca gtcctgtatt tggcaaggtt tgggaacctg 660 

aaggggccaa tntnccttga cacttanang cacttgcctt tcagacacca ttccgngcnt 72 0 

ctggtaaaag gggacaagaa aagccttaac cttggcnnca tattttgaca gggg 774 

<210> 413 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 413 

gnngnnnnnn tttctaatgc ttgggnnnnn ngtcnatgcn taagagccan gcggntcgaa 60 

ttcggcacga ggcgggcccg gccagcggaa gcccctgcgc ccgcgccatg tcaaagaaaa 12 0 

aaaggactga gtgcagaaga aaagagaact cgcntgatgg aaatattttc tgaaacaaaa 18 0 

gatgtatttc anttaaaaga cttggagaag attgctccca aagagaaagg ctttactgct 240 

atgtcagtaa aagaagtcct tcaaagctta gttgatgatg gtatggttga ctgtgagagg 3 00 

atcggaactt ctaattatta ttgggctttt ccaagtaaag ctcttcatgc aaggaaacat 360 

aagttggagg ttctggaatc tcagttgtct gagggaagtc aaaagcatgc aagcctacag 42 0 

aaaagcattg agaaagctaa aattggccga tgttgaaacg gaagagcgac caggcttagc 480 

aaaagacttt cttcacttcg agaccaaang ggaacagcta aaggcagaag tagaaaaaat 540 

ncaaagactg tgatcccgca agttgtngga agaaatcgcc aagcaaatna agtagcccaa 600 

ggaactgctt acagatggac tgattacata ttcgcaataa aatcttnggc ccaaagaaaa 66 0 

atttngggtt tgaaggaaaa ttaaattggt tngaaccttt tggaatttcc cgaaagactt 720 

ttgcctncnt ngacttaaaa tatttccatg gnggtgaaag gttgtccaan ctt 773 

<210> 414 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 414 

gnagnnnnnn nttctaatgc ttggggnnnn nngtcaatnc ctnngancna ggcggntcgc 6 0 

tcatccagaa angtcagatc ancaaagaag tccangaaaa antgcgaccc agctngaccn 12 0 

tttgatccca ggcttagcac acgattgcat ggcntcccct ttagccactt naaccactgc 18 0 

agacntccag gaagctggac t etc tec tea gtccntccag acttctggcc accacagant 24 0 

gaaaacccca ttttcaactg agctatcttt gctccagcct gatactccag actgtgctgg 300 

agatagtcat accccactgg ctttttcctt caccgaggac ttggaaagtt cttgtttgct 360 

agaccgaaag gaagaaaaag gggattctgc caggaaatgg gaatggcttc atgagtctaa 42 0 

gaagactatc agagtatgga gaaacacacc aaactacctg gggacaaatg ctgtcagccc 480 

ttaggcaaga ctaaattgga aagaaaggtg tetgecaaag aaaacaggca ggcccctgtc 54 0 

ctccttcaaa catacaggga atcctggaat ggagaaaaca tagaatcagt gaaacaaacc 600 

cgtagtccag ttctgngttt tcctgggata tgaaaagaat gaccanggac tnctggagtc 66 0 

aacttttcac ttgaagaatc tcaaggccac eggtcattgg ccacacactn gaactccttt 720 

ttaagatgta cccattactg gaattgggct taggg 755 



<210> 415 
<211> 852 
<212> DNA 
<213> Homo sapiens 
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<400> 415 



gnagnaannn ttctaatgct 
ttcggcacga ggtcacaggc 
ccnttcgtag gatggtgagt 
ctttaccgtg cgcnttggag 
tgagccagct ggactgggct 
gagaagacac tcaggcagca 
ccaccatagt caaggctgca 
ttccattcgc ttcaccgggg 
tgcccatctt naaggcagnc 
nttttttttt nctttnaaag 
gaattgggnc ccaaagggcc 
acctgggaag aagaaaatca 
aatggaaaac ctnaaaaaaa 
agtcnnatta ccnttanatc 
cnnaacttng at 



tgggnnnnnn ngtcaaacct tannaacctg gcntgncgaa 
aggtttantg gccagtttaa aacttatgct ntctgcgggg 
gtttccctgg gctttgctca tcacttcggg acatcgtgga 
tgtgtgatgg tgcctgagta gatctgctgg cagagtagtt 
ggccgcctgc cgcttcttga gggtggaaga ggggtgctct 
gactctgcct ctcactagga ggtgcccccc cgaccccgtt 
ggctgccccg ggagaagtgg ctccccttct tgcgcctgtc 
gganaagacn ttgggcttgg ttggcacagc ntgacccttc 
ccggaantgg gaaaaatatt tctttaaatg gtggcctttn 
gggttgaagt tccannaatg natttcccaa tttccttccc 
ccaatggggc antcggtcct ttaaaaagna acctttttgg 
cccagattgt tgggaaatat tttggncatt aaaataaant 
aaaaaaaaaa aaaactcgag cccnttaaaa acttttagtg 
canacnttga tangaanctt tggataattt tgggncaaac 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
852 



60 



<210> 416 
<211> 754 
<212> DNA 
<213 > Homo sapiens 

<400> 416 

ggnnnnnnng tnaaaccttc cnaannaggc tnggcgtcac tgnccccggt caacaaaccc 6 0 

acttttatga cagttttctt ccgcagcttg gctnttaaat tttactggca ggtgtatggt 120 

tgttggaggg ttcctagtga gttgggggac ctggcantan agctgcttgg ttggaggaag 180 

tgaanctggc ttantaccag cagctgatct cttccacgtg ctgctgcttt ttttgccact 240 

ctgatactaa accagagaaa gctgcaggtg gataaagaag ctgtggctgt tttttgcttt 300 

tgggtggcaa tgagaaagag tcacagtgtg ggttaaaggg atctgcagtg gggccaagga 36 0 

tgccacccca ccctcagctg tangcaagct tgcacataaa taacccccgt cagtggagtg 42 0 

ttcgggatgc agggggcant atagtgttct tggactttgt ccgtcctggg gcagttttta 480 

agttctttat atttaagtgg ggtcagtgcc aagtgctacc actttcccaa taaangaatg 540 

ggggacccan aaggctgggg tccctggcta ccttgttatg aaggttttgn tntttctctg 60 0 

acaaganttg ctttggaaag ancctgtttt taggggatta ttttttgnat accccgatgg 66 0 

gganccaggg ttctnctcaa aacccttaca acccttagga tcatagggaa aaggggcccn 72 0 

tnttttnctg ctggcttncc caacttaaaa acnt 754 

<210> 417 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 417 

ngtntatagc ttnctaatgc ttcntancga attcggancg agagaagccn tgagcagcaa 6 0 

agtctntcgc gacaccctgt acgaggcggt gcgggaagtc ctgcacggga nccagcgcaa 12 0 

gcgccgcaag ttcctggaaa cggtggagtt gcagatcagc ttgaagaact ntgatcccca 18 0 

naaggacaag cgcttttcgg gcaccgtcag gcttaagtcc actccccgcc ctaagttctc 24 0 

tgtgtgtgtc ctgggggacc agcagcactg tgacgaggct aaggccgtgg atatccccca 300 

catggacatc gaggcgctga aaaaactcaa caggaataaa aactggtcaa gaagcttggc 36 0 

caagaagtat gatgcgtttt tggcctcaga gtcttttgat caagcagatt ccacgaatcc 420 

tcggcccagg tttaaataag gcaggaaagt tccctttcct gtnacacaca acgaaacatg 480 
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gtggccaaag tggatgangt gaagtncaca atcaagttnc aaatgaagaa ggtgttatgt 54 0 

ctggctgtan cttgttggtc acgttgaaga tgacnngacg atgaancttg gggtataaca 6 00 

ttcacctggc tgtcaacttc ttggnggtca attgcntcaa agaaaaaact tgggcagaaa 660 

tgttccnggc cttatnttnt caagaaccnc catggggcna agccccaacg ccctttnttt 720 

aaaggcncat ttggaattaa attcntnttt ncccg 755 



<210> 418 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<4O0> 418 

tggggnntnn nttctaatgc tgggatgttc taaangntgg gctactcgtt ctttccgcag 60 

gancccntcg attcgaattc ggcacgagga aaggtggcgc gcttctcacg gctgagttgc 12 0 

tgcgcctgca gacggaagct ccccacaggc agagctgctt ggatgtgtga gtcatgaagc 180 

cagagaagcc ccgctccatg agcagtgact ccccaggccc tgtgacctcc ctcctgtctt 24 0 

gcagctcctc ctggcaccag tccccagggc tctcctgttg gtagttcctg cttttcttct 300 

tggaaattcc tcgtggacct cgagatcttt accctaaaat agttctgttg aatttcaccc 360 

tggcaatgta aattgatagc ttatcttcac agatgccaga caatggacaa ctcaccatca 420 

gtcctctgct cacctgagac aaatgcatgt ctgattgctt cctctgccct attgnttatg 480 

tgaaaatgca gattcactga gccagactaa ggcatcagtg actgttcctc tactgcctct 540 

cacatggaga ttgtgtattc agtgaaaggc tgatcaaaga ccccaaagga atgcaccagt 6 00 

ttatctctta tctacctatg acctgcgagc tgnccaccac ccccagttgt tgcgcctttc 660 

cagacagaac cagtgtcatc ttacacgtat taattggatg tcctgngnct tccttaatat 720 

gtatcaaaac aagctngcct tgaacacctt gggcacn 757 



<210> 419 
<211> 738 
<212> DNA 
<213> Homo sapiens 



<400> 419 

gnnngncgtt cnaattncgn ggnntctttc tngccnanna nnannngcgt gngngaattc 6 0 

ggcacgagac tgttcatcct aagttccact ataaacaggc tcatgactcg ggcacagaca 120 

cttcttgcgt gactttttcc tatgatggta atgtccttgc ctctcgtgga ggtgacgatt 180 

cattaaaatt atgggacatc cgacaattta ataaaccact tttttcagcc tcgggtcttc 240 

ccaccatgtt cccaatgact gactgctgtt tcagtccaga tgataagctc atagtcactg 3 00 

gtacatctat tcaaagagga tgtggcagcg gcaaacttgt tttctttgag cgtaggactt 360 

tccaaagggt gtatgaaata gacatcacag atgcgagtgt tgttcgctgc ctgtggcatc 420 

caaagctgaa ccagatcatg gttggaactg gaaatggatt ggctaaagtc tattacgacc 480 

ccaacaagag tcagagggga gcaaaattat gtgtggttaa aacccancgg aaggcaaaac 540 

aagctgagac tctactcagg actacatcat cacccctcat gccttgccta tgttcccgtg 600 

agccccgnca acggagtaca aaggaaacag ctggagaagg acagactgga tccctgaagt 660 

cgcattaacc tgaacctcct gtancangcc cangtcgtgg tggccgattt ggaacccacg 720 

ggggcactnt tttttcct 738 



<210> 420 
<211> 739 
<212> DNA 
<213> Homo sapiens 
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<400> 420 

gcgntnntat tagcgtgggc tcgntctcgc tcnacncanc nngngctggn cgaattcggt 6 0 

acgagaatca gaggaggctt cttcatcctt caactccatg atgaactcct atatgaagtg 12 0 

gcagaagaag atgttgttca ggtagctcag attgtcaaga atgaaatgga aagtgctgta 18 0 

aaactgtctg tgaaattgaa agtgaaagtg aaaataggcg ccagctgggg agagctaaag 24 0 

gactttgatg tgtaactgtg ctgttgatga agtcctccca gggaagcctg tgcagatgca 300 

gtcacctgga aagaacagag attccctttc acctacctca gcaaaacaaa ctttcaagtc 360 

ttgatagact tagcctagta attttatagt gagagtttca aactatatat caagtgtcta 42 0 

tagcatcaaa aacttctggg ggcgtggggg aaagtagaat accaagtata atagttacat 48 0 

tcactttcaa agagcatcta tgaatttgcc ttttgtaact tactgtggct ttaaacatat 540 

tcagaacaga tgcttgaaat atgcacttag cactttggtt ccacatctgt ctgggtaaac 600 

catgaagaaa atgaagctgc tgcctcaatc gancccagac agcagccata ggcagataaa 660 

gatttnggtt cacccttggt ggtgggaggc atcgtgtgtg cctttttttc ctctaatatc 72 0 

aattttacag tccgggaan 739 

<210> 421 
<211> 727 
<212> DNA 

<213> Homo sapiens 
<400> 421 

gtgatctttn tgagtggggg ccntnctngc tctannanat aggttnggng ggctagcgat 60 

ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcacgagtt tctggagttc - 120 

gaatttcggc ccggacggaa agcttagata tgccaacaac agcaattaca aaaatgatgt 18 0 

gatgatcaga aaagagctta tgtgcacaag agtgtaatgg aagaactgaa gagaattatt 24 0 

gatgacagtg aaattacaaa agaagatgat gctttgtggc ctcccctgat agggttggcc 3 00 

gacaggagct tgaaattgta attggagatg agcacatatc ttttaccaca tcaaaaatag 360 

gttctcttat tgatgtaaat caagtcaaag gatcctgaag gccttcgagt attttactat 420 

ttggtacaag acttgaaatg tttagttttc agtcttattg gattacactt caagattaaa 480 

ccaatttaaa ttgtatgttt tcaagctggt tgnatattta attaaaggga tgggaagggg 54 0 

ttatttgtca tttacagtat tggggtttta tgaatgtgaa gcaaccaaaa aaaattnnaa 600 

tgtaaaactg gaaaatagga aaattcatta ncagcttaat gggtatcctt acttgatncn 660 

ctgggtttgg aagtccccac acacattaaa tctgtaatga aancnctttt ggttaaaatt 720 

tctctat 727 

<210> 422 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 422 

gtntngnnng nngttnnatt atatggntcg nctnnctcna nnancnangc ttgngctgac 60 

aacttgattg ggttctcctt caggtttgaa gcgccctcna gaagtgtcta aaggagacag 12 0 

ttgatagcca aacaacagtt ttggattcac tgactgatta tgaaagaagc agtagactgg 180 

tatcaagaat cagtcagcaa ggaggccctc accagacgcc agtgccatgt tcttggactt 240 

ctcagcctcc atattcatga actaagtttt tggaatcctt aggcttccac gtgtggaaag 3 00 

cctgagctaa cctactggag gatgagccat cacctggagc agattcaggc catcctagtt 360 

gaagcctccc taggccaagc aaccgtccaa ctaccagaca ttgaccattc agccttgaac 420 

attcagcaca aagacaaaac agaccagacc agaagagtcc cacagaatag gggaaactat 48 0 

tcagagaaaa cttaagccac taagttttat ggtgttttgt tcttgtagcc agaagcatag 540 

gcatactggc caatacaaac cgaaatcctt ctaacgtant ggaccctttt caggccagca 600 
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ttttttccct tgaaaacctg ggagccttgt attccatctt attagcagaa gatcactttc 66 0 

accaatggtt tgggctcttg atttggaatt gatgatgtaa tgagcctnta ttcnanatgn 72 0 

gacttaatac ctctgcgaat tgactggatt ccn 753 

<210> 423 
<211> 844 
<212> DNA 

<213> Homo sapiens 
<400> 423 

nggnnnttnn nnnnnatncc ntgatcgtgt ntcgttcttt ctncaggatn nnntcgtttc 6 0 

gaattcggca cgaggaaaag ggagccgcgc agngcctacg ggagtncggc ggcagcagcc 12 0 

ggtaccggca accacgggca gctctcaggg aatctccgtc gttgaggcca naggctccag 18 0 

tccccgcgag tccagatgcc tgtccagcct ccaagcaaag acacagaaga gatggaagca 24 0 

gagggtgatt ctgctgctga gatgaatggg gaggaggaag agagtgagga ggagcgganc 30 0 

ggcagccaga cagagtcaga agaggagagc tccgagatgg atgatgagga ctatgagcga 36 0 

cgccgcancn agtgtttcag tnagatgctg gacctggaga agcagttctc ggaagctaaa 42 0 

nggagaagtt gttcaaggga acgacttgan tcanctgccg gnttgcggct tggaaggaaa 48 0 

ntgggggggc ttgaanaaga agcccctgga atnccaccgg aagccccctt ttgggggggg 54 0 

gccttgcaaa ccgggaancc ctttnaaagg aatttcngcc antttcaang gttgggccaa 600 

ggggaatcnt accnaagggg ccttctnggc cttggnatgg tgaatccang gnaaattaag 66 0 

gtncccaatt gntgaancct tccaanggga ancccaaacc agcacccttg naanaagttg 72 0 

agaaaacttg cttgcntctt ntgacacccc tncnaggggg aacttcaagg aaccggttcc 78 0 

tnaggcttgg aaggaggacc cccananccc tggancctaa attnttaaat gggtnggacc 84 0 

accn ^ " ^ 844 

<210> 424 
<211> 799 
<212> DNA 

<213> Homo sapiens 
<400> 424 

ggagnnnngn ntccnaattn nntgggnnnn nnngtcaaan nctngctact cgttctttcc 6 0 

gcaggatccc atgcgattcg aattcggcac gagcccagac ctatggagtc agacagtagg 120 

tttgaggccc agcaatctat ggtttaacaa gccatccagg tgtttctgat gcacagtgaa 18 0 

attggggtac cactggtatt aggtttggta tggcaacttt ttcatcactt gttttatgta 24 0 

gttgtctgat caattgtgaa aacataatga atgttggaaa tggaacagta aaataacgaa 300 

agccaacttt tttttttttt tttgagacgg agtcttgctc tgtcgcccag gctggagtgc 360 

agtggcgcga tctcggctca ctgcaagctc cgcctcctgg gttcacgcca ttctcctgcc 42 0 

tcagcctccc gagtagctgg gactacaggc gcccgncacc acgcccggct aattttttgn 480 

atttttagta gagacggggt ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 54 0 

gtgatccacc cgcctngggc ttccaaagtg ctgggattac aggcgtgagc caccgggccc 600 

gggccaaaag ccaactcttt atgcctagaa aatattgtgc accctatgac ccaagcccat 66 0 

tgaatttttn cngggaaatt tatggtaaat tattgaaatg gatggtacct ttaaaaagtt 720 

atttggcaca ttccccttgg gttacctttg gnatggtttg ccagggaatt naaaactttg 780 

ggntnaaacc ttttttann 799 

<210> 425 
<211> 750 
<212> DNA 

<213> Homo sapiens 
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<400> 425 

gangccggat tccaattntc nggctnctct naaannctgt ntaatgcttg gtccgcanga 60 

ncccatgcga ttcgtggagg tctcctttcg ccccagccca ggtggccaag cccatcctgg 12 0 

cctcagaaca tgctgagcac attttgtagg gtggcacctt tttatccaag ttactagcta 180 

cacatcagtg tttaaagaga aaaaagtgac ctttcatttt tttttcttga aacttgagga 240 

aacaagatac atactactga tttttttttt cttaaaacta aatgcatgac tgcagagcgg 300 

tagaggtgta tatttttcat actgtggggc aaagtatttg tgctgctttt tggagatgga 360 

ctggaacgtc tggtttctgt ccccgggccc ggcagctacg tctattttct gtagaaggtg 42 0 

ccacagtgag acctggagcc accccttcct gcctggcgcc gtttanagct gggagcccgt 480 

ggactccggc ctgtttctac cttctattca accactctga cgtggggaga caagaagaaa 54 0 

tagaactttt tgatagtgtg gtaaaaacat tggattttga actattttag taaaaggagt 600 

taccaacaag aatgtnatag gtgctacttt gagctagata aataaaggct ctttgtgagc 660 

ctcctgaaaa aaaaaaantt nnnnnnnnnn atnannnnnn annaaaaaaa ctggnccttt 72 0 

aaaactttan gggncgttta cctanaccct 750 

<210> 426 
<211> 819 
<212> DNA 

<213> Homo sapiens 
<400> 426 

gnagnncggn ttcttatgat cgtggctnct cntctanngg ttgtgtaatg ctnggtcnnc 6 0 

angannnnnt gcganncgaa ttcggcacga aggggggttc ccaatagtag aaaagggtcc 12 0 

ccattcctgc tcagcaccgc acctctctac ccccccacag acacacatgc agacacacac 180 

atgcagacaa cacgcagaca cacacatgca ggcactcaca tgcaggccca tgcacacaca 24 0 

cgtgcacaca catgcagaga catgcagaca cgcaggcaca catgcacaca tgcaaagaca 3 00 

cgcatgcagg cacacgcaga cgcacacaga gacacacatg cagatcacat gcacacacac 360 

atacacacac tggcccctgt ttttctgtgg tgtcactggg tgccagcaac tcggtatctn 420 

ccaccttcca ctaaaacctg ggccttaatt tctctcccgt ccccacccct aaattcctga 480 

tggatgaacc tagagctgtc ctgtccactc caggccggac tgacgtancc tatgggccca 54 0 

gcaggtccag ggcccacgtt ttaatttctt tttnaaaagc tttaggtctt ggccnggccg 600 

ccggtggttc acgccttggg agttcccagc atttttnggg aaggccnaag gccgggttgg 660 

attcacaaag gtcaagcaag tttcaaggaa ccaagccttg aaccaggcca ttgggtgagg 720 

aaccctgggc ttnttactng ggnaaattcc caaaaaaaaa ttggccttgg gccnaagggt 780 

gggcaagggc acccttgttg gggtccccaa antttacct 819 

<210> 427 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 427 

gagnrmgatt cnaattnctg ggctnctctc ttnntatnta atgctgggtc cgcangancc 6 0 

nntgcgattc gaattcggca cgaggtccaa ggacaacttc gagacatttc tttttgccac 120 

cgtatctaac agggagcagg aagatctctg ccgaggaatt gtccagctct gcttcaatga 18 0 

gcaaagccaa cagctgctag cagaggtcca gccctctgac tctttcctca tggtagagac 24 0 

aactgcatac tttgaggcct acaggcacgt cctggaagga ctccaggagg tccaggagga 3 00 

agatgttccc ttccagagga atatcgtgga gtgtaactct catgtgaagg agccaaggta 360 

cttgctaatg gggggcagat atgactttac ccccttaata gagaatcctt cagccactgg 42 0 

ggaatttcta agaaatgtcg agggtttgag acatcccaga attaatgtct tagatcctgg 480 

ccagtggccc tcaaaagaag ccctgaactg gatgactcca gatggaagcc ttgcagtttg 54 0 
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ctctcacaag ggaactggct attattcaag gaccttctgg aacaggcnaa acctatgtgg 
gtctnaaaaa ttgttcaagc ccttctacca acgagtcttg tttggcaaaa ttaaccttca 
gaaattccca tcttggttgn gtgtatacta atcatgcttt ggaccanttc tggaangctt 
ttccattgtc agaaaaccan atttggccgg 



600 
660 
720 
750 



<210> 428 
<211> 943 
<212> DNA 

<213> Homo sapiens 
<400> 428 

gnngnccggt ttcctattct cnggcanctc tcttcctncn acctattanc tggactctaa 60 

anaaaagnnt gnngcggttg gctcaagggc caccanaaca tttctttatt attattattt 12 0 

tttaacctgn acatgcntta aagggtctat tacctttctt tccgtctgtc tcaacagctg 180 

aaatggggcc nccaaggagt gccttccttt tgctccctcc tactgggact gacggntggn 24 0 

antgtntgnn cccanntggg ggtgtctcct gnctgggaag ganggaaagg gaggcanagt 3 00 

tttgccgggg ttgcanntng acancangct gnanaggana tggctaataa ctgtttaatg 36 0 

gaaacctgct tgggcttgga nggaacttag nctgaatttt cccgacttcc tctgccagtt 42 0 

attgacacan tctctttnta agacangaaa taaactaaac cccaccccaa ggnantnatn 48 0 

ncangcngaa aacnncncat ngcccacatt ncctnatccc ntancaccnn ctcnttnttt 54 0 

nncccaanac tncttcccan ntntcnccnt ttacccntan ncntnnttnt atcccnctaa 60 0 

tncctnannn ccntnnttnt ccnnatnctt acncncncnn ntnnnncccn nntctttnnn 66 0 

cccaaanctn ncctcncnnt tcnnctnaac cntntnnnca nnanacaccc ttctnatnnc 72 0 

ccannntctn cacnntnnnt ntctccnnnt nnncnccnnn ntcntnnnna nancntntnn 78 0 

nanancnatc tnntncncnn cnantnnnnt tcanttcacn ctctnnnnnn tntanccnat 84 0 

tnnccntcnn tnnccnntta nnncntnnnn nncnaantnn nnnnanctct ncnnncnnct 90 0 

carmtnnnnt nnnnnncnnt cnanaccntn nnnntctatn ccc 94 3 

<210> 429 
<211> 775 
<212> DNA 
<213> Homo sapiens 



<400> 
gnangnnnnn 
tcncncgcag 
gaaaagccca 
tacgcagatt 
gcacaccctc 
cattgtggca 
gtcttgttct 
tcccatcctt 
nnnnnnnnnt 
tgccgngttt 
atgtttaana 
acccctnagg 
ctgcnanncc 



429 
nntttctaan 
naangctgtg 
agatgactac 
tgcccatctc 
caacctgggc 
ctgttgcccc 
cttgtctgtg 
acactctgga 
ttttttgagc 
tttaccttgg 
cctggnattc 
attctcatcc 
cngccttttg 



tncttgggnn 
gcgctccatt 
aggatattag 
ttgaatcaaa 
agggactggg 
ccatggcggc 
gtctgtttaa 
tcattgtgca 
tcaaagaaag 
atttaaaggc 
ancttcataa 
tttacanntn 
ncccaagctt 



nnnngtcann 
gtgaaagatc 
tgcatgcacc 
gccagcaaga 
gggatgcagt 
atgggcaaga 
tatgtgggtc 
gacttaatca 
ttctcatttt 
accttangtt 
gaccaaagan 
gactttttcc 
ttntgntngn 



gattnngcta 
caggcatttt 
ccaccctcct 
cttctctgct 
gtgtgttagt 
tgaccttcca 
actagggtat 
gggttttaac 
ccctattcaa 
ggggcaacag 
ggagctttcc 
aggccaattt 
ccccccattt 



aaggttngga 
tccgagccag 
ctcagtgtgg 
gctgtgatct 
gcccatgtgg 
ttagcttcaa 
ttattctttc 
gctttcattn 
ctaataccca 
attctcactc 
ctttctcttt 
cccatnnaat 
acccn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
775 



<210> 430 
<211> 763 
<212> DNA 
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<213> Homo sapiens 



<400> 430 



ngggtgnnnn nntttctaat 
tcgttctttc tccangcagn 
tgaaaaatca tccaagacag 
tgtgtttaca gacgtgtgca 
gatcgttttt tcccagggca 
attccccttg cctaaataca 
caatgatgat gatttcagca 
cacaaaccca agggcagggg 
cctctgggga gctcangaag 
gtagacaagt atggctcctc 
ttagataaga caccccttaa 
aggtccccaa atgagaagtc 
ntgacagcag tcagggacag 



nctggggnnc nntnnncnnn ntttcctaat ncttaggngc 
nnngcgtttc gcgacagctc tccaatactc aggttaatgc 
ttattgcaag agtttaattt ttgaaaactg gctactgctc 
gttgtaggca tgtagctaca ggacattttt aagggcccag 
agcagaagag aaaatgttgt atatgtcttt tacccggcac 
agggctggag tctgcacggg acctattaga gtattttcca 
gggatgacgt catcatcaca ttcagggcta ttttttcccc 
ccactcttag ctaaatccct ccccgtgact gcaatagaac 
gggtgtgctg agttctataa tataagctgc catatatttt 
cgtatctcct cttcctagga gaggagtgtg aacaaggagc 
acccattccc ttttccagga gacctaccct tcacaggcac 
tgctacctca tttctcatct ttttactaaa ctcaaangca 
acattcattt cttnatacct tec 



360 
420 
480 
540 
600 
660 
720 
763 



300 



240 



60 
120 
180 



<210> 431 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 431 

tggtgttnnn ntcctaatgc ttggnngnnn ggtnannctt ctaattactt tggggctcgt 6 0 

tctntctcna ennngenngg cgttncgaat teggcacgag ettgaagege tggtttttct 120 

cgaagcaatc cttattatat tgttaaacaa ggaaagatca accagatggc aacagcacca 180 

gattctcaga gattaaagct attaagagaa gtagctggta ctagagtgta tgacgaacga 240 

aaggaagaaa gcatctcctt aatgaaagaa acagagggca aacgggaaaa aatcaatgag 300 

ttgttaaaat acattgaaga gagattacat actctagagg aagaaaagga agaactagct 360 

cagtatcaga agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa 420 

cttaacgaga ctcgtgccaa acttgatgag etttctgeta agegagagae tagtggagaa 48 0 

aaatccagac aattaagaga tgctcancag gatgeaagag ataaaatgga ggatatcgaa 54 0 

cgccaagtta gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt 600 

aatgctgaaa gacaagaacn gattaagcag aggactaant tggagcttaa ageccaagat 660 

ttacaagatg aactaccggc aatagtgaac aaaggaaacc gttttttaaa agaaangcen 72 0 

aanctgcttg aaaaaaaaaa aaaaaaactc ggectntaan t 761 

<210> 432 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 432 



gnngantnng tcttattatc gtggngctct nactnnctct aaatanaatt gtgttgnggg 
aatteggcac gaggccaccg aagcttcagg atgacatctt agactctctt ggtcagggga 
tcaatgagtt aaagactgea gaacaaatca acgagcatgt ttcaggcccc tttgtgcagt 
tctttgtcaa gattgtgggc cattatgett cctatatcaa gegggaagea aatgggcaag 
gccacttcca agaaagatcc ttctgtaagg ctctgacctc caagaccaac cgccgatttg 
tgaagaagtt tgtgaagaca cagctcttct cacttttcat ccaggaagcc gagaagagca 
agaatcctcc tgcaggctat ttccaacaga aaatacttga atatgaggaa cagaagaaac 
agaagaaacc aagggaaaaa actgtgaaat aagagctgtg gtgaataaga atgactagag 



480 



300 
360 
420 



120 
180 



240 



60 
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<210> 435 
<211> 755 
<212> DNA 
<213> Homo sapiens 



ctacacacca tttctggact tcagcccctg ccagtgtggc aggatcagca aaactgtcag 54 0 

cttccaaaat ccatatcctc actctgagtc ttggtatcca ggtatttgtt tcaaactggt 600 

gtctgagatt tggatccctg gnattggatt tcttaaggac ttttggangg ctcttgacac 66 0 

catgcttcac agaacttggg cttcanaagc ttcanttttt tgcanaggtg ccccaggtta 720 

ggaaaacagt tntncttgtt ttgtannt 748 

<210> 433 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 433 

gggnaaaagt ttnnnannng ggnagnnnng ntnnaccntt cctattactt tggagctcga 60 

actcgcncca canannnagt gncntgngct gttttgcaga tgaggaaaac tgaggtacag 12 0 

aattcttagg gaacttaccc aaaatggctt ttctgcactc tgccctttgg tattgtccca 180 

tgtgaattgt ttaaaactta tgtgtatagt ggcatgagta ggtgatttca gaaacagaac 24 0 

tcacttttgt tgtttggtct taaaattagg aacttttctt catctgggct tcatttccct 300 

gcaccttccc agctttctag tcatgcaagc cacatgtctc cacgtgaggg gttcattgga 360 

aagcagccac agagccaccc cctggctggg ttcttcccca gctctgcttc ctccttcccc 420 

aagtcctgca gctgctctct ccatggcaga accacttctc cccttactgg aggggaggtc 4 80 

cactgaacaa atccaggaga ggaatcattg tgttttccac agaagagaaa gtacactgga 54 0 

ctttctgtgc aacctgttac tacattttca caganactca tatttgtgca ntgtaactca 600 

atttgaaacc cagcaaaatt aggctcccgt gtctccataa aaggccacca tgatggtaac 66 0 

cgttggactt caccttgtgt ttnggacana ngctgattgg attttaccca tcatcacanc 720 

cgtgtcttac attctcnttt cctgggcttt ggacccctgn tanaaaaan 769 

<210> 434 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<400> 434 

ctanccttcc taaannctng gctactcgnt ctttctnnan ganncnnntg cgatncgaat 60 

tcggcacgag caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca 120 

gtgcaggaca caagctcatg cccccgtctt gccagtgaca cttgaagcct cccgacttcc 180 

acagagtgct tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt 240 

tttttgagat ggagtctcgt tctgttgccc aggctggagt gcagtggcct gatctcggct 3 00 

cactgcaacc tctgcctccc aggttcaagc gattcttctg cctcagcctc cagagtagct 360 

gggactatag acatgcacca ccacgcccgg ctaattttgt atttttggtc gagacggggt 420 

tttgccatgt tagtcangct ggtcttgaac tnctgacctc aagtgatcca ccactcggcc 480 

tccaaagtgt tgagatgaca ggcacgagcc accagcccaa cctgagtggt attttcttta 540 
gggaccangt agactttaaa acgagggtaa gagaaaaagc ccagtggtct tttctgangg 
taaataaatt tctgcccagg aaacnttncc aagccccaac cagcaagcca acccttaaaa 

aaaaaatcac ttcgtgttcc ccaangggan ctttnttaaa gctttggggg cttccaggna 720 

aaatcatttc cagtnnaant ttggaagaat tcannagnat ttnt 764 



600 
660 
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<400> 435 

gnnnnntttc taatgtggnn nngnnngnta annttctaaa ncttgggntc tcgttctttc 60 

tncagatncc ntcgattcga attcggcacg agggatcctt tccagacaga agaccccttc 120 

aaatctgacc catttaaagg agctgacccc ttcaaaggcg acccgttcca gaatgacccc 180 

tttgcagaac agcagacaac ttcaacagat ccatttggag gggacccttt caaagaaagt 24 0 

gacccattcc gtggctctgc cactgacgac ttcttcaaga aacagacaaa gaatgaccca 3 00 

tttacctcgg atccattcac gaaaaaccct tccttacctt cgaagctcga cccctttgaa 360 

tccagtgatc ccttttcatc ctccagtgtc tcctcaaaag gatcagatcc ctttggaacc 420 

ttagatccct tcggaagtgg gtccttcaat agtgctgaag gctttgccga cttcagccag 480 

atgtccaagg gtgcctgggg aagagccact gcgcatgtta tctttggtgt tactccagtg 54 0 

ttgaacanag agctggtcag aggcagtgca tcgcanagag acattaataa gggaatcctt 6 00 

tgaatcccta ancagcanca gctttnctga nggggccnat gatgccagtg acctnttcan 660 

ggnaagtctg ggacattggg accaccctgg ggggaagaac ttgtgggatg tggcttttct 72 0 

tttatgaata aagtactttg agttggttgn aatcn 755 

<210> 436 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 436 

aaggctggnn nnngnnntgc nnnncttcnt attantctgg gggctcgtnc tctctcnann 6 0 

nagnnaggcg ntgngaattc ggcacgagct caagaaaagg agaaagtttt tttgtatgaa 12 0 

attggaggaa atattgggga acgctgcctt gatgatgaca cttacatgaa ggatttatat 180 

cagcttaacc caaatgctga gtgggttata aagtcaaagc cattgtagaa gacttaacaa 240 

gctgcagata accatgtgga cttctgtcat aattcttgct gagtcaagag tgtaaataaa 300 

agaaatggca ggactcatat tattcagttg tcccaagtat ttaaaaatga ctctcttaag 360 

ccttaaaaag tcatagattt gtgctgctgc cagaattata ttaattatta ttaatggtat 420 

tattagaaaa aaaatttctg gagtgagagt naagangctt aattagtttg tgggcagttt 480 

tcatatgctc tgtgaaatgt gtccagatgt gacataagtt ttttttttta atatggngga 540 

aatgncttct ctttcccatt cttttctcct aaaaatcata tatactggga atatatgcct 600 

ctnttacctc tattaccctc ctcacattta ccctttccca gttnggtttt gctttttnac 660 

caaaaagatt ccaatnccna ggtattggca agttntnaaa accgcccntt aaacatccct 720 

aatttcncag nattccnnnc ttgccaaatn ttngtntcnn 760 

<210> 437 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 437 

ggnnnnngnn ngntnncgtt ccctattant caggngctcg ntctntctcn annnancnng 6 0 

gcgtgtncga attcggcacg aggattttcg aaactcttca gctacttgcc cttttttatc 120 

tgaaaccatc ataccttctg aaagaaaaaa gcatatcttc attgacataa cagaagtgag 18 0 

atggcccagt cttgatacag atggtccatg atatatatgg agagtggcat tgtgaagata 24 0 

acatctttag atggtcatgc atacctctgc ctgcccagat ctcagcatga atttacagta 300 

cattttttgt gtaaagttag ccagaagtca gactcatctg cagtgttgtc agaaacaaat 360 

aataaagccc caaaagataa actagttgaa aaaactggca aaatctgtat acgtggaaat 42 0 

ttaccaggac agagactgaa gaataaagaa aatgagtttc attgccagat catgaaatcc 480 
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aaagaaactt taaagaagat gagttgtgta aatggaactg aagggagggg aagaactgcc 54 0 

ttcgcctggt acaaagcaca catgtgtata cacatgggtc aagcagtgct ggtctgtggc 600 

tgnctgtcca gangaatgga aatatccttg gctttagcac ttcattttca taataaaatc 660 

agcaattntg tctaaaaaaa aaaannnana aaaaactnga gcctntanaa ctntagtgag 72 0 

tcgtattacg tagatncnna catgataa 748 

<210> 438 
<211> 823 
<212> DNA 
<213> Homo sapiens 

<400> 438 

taatccttnn tattgntcgg gtactngntc tntctcnaag annntntcgt tncgcccagg 60 

tagctgagac tacccacacc ttggtcccag ctacttggga ggctgaggtg ggaaaatcac 12 0 

tttgcccagg aattcaaggc cgcagtgagc tatgattgca ccactgcact ccaggcaaca 180 

gagtgagacc ctgtcttaaa aaaagaaggg agaaagtgtc agatggtgat gaggtctggg 240 

ggggaaatag agaatgggga tcaggagtgt ggatggtggt attccctcac caagaggtga 3 00 

catgttgagc agggaacttg ggaggtgagg gtgtgacccg tgtggaaatc agggaaaagc 36 0 

attncagcct gagggacagc caatgcanag gccgtgaggt ggccagtgcc actgagcagt 42 0 

gagcttggga tagggggcan gtgangaggc tggagagcgg ggtcagacaa accaatatgc 480 

ttatttaaaa caaggttgtt ncagcaccct tgccttaaag ccttgagcct gnaanctnga 540 

aaaatttggg cacnttcaaa agcanggang gaaaccaaaa gaagattggg agggaaaagc 6 00 

ccttncnttc ccttancagg aaatgaagtt nccacccttn aaaacaggnc caggaccttt 660 

ttgggaccct tttggccttt tggttcctta gaatcctctt ggtngcttnn gaatnaaaag 72 0 

gnaaaagggg cctttaaggg gggatcccat tntttccaaa attcaaaggg ggctttccct 780 

gggcttaccc aaaatttctt ggncttaant aaaaaaattt ntt 823 

<210> 439 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 439 

gnnnnngntt ctaatgctgg nnnnnnnngg taccctttcc aaaacctggg ctctcgntct 60 

ttctncangn agccnngcga ttcgtctgtc tggtgatttt tattttaagt gaacctttgg 120 

atctatcttt aactctcttt attgtgagtc taaattccaa ttctgcagca gatcagtaaa 180 

ctcacagtat ttttcctgtg gaaatctatt caataaggaa accaagacag gatantaaaa 24 0 

tttaaaaaaa ancaactttg aattcccctg cctaggtctt ccagttgttt tccagcgcat 300 

acctcaggta tgactttgct agccggggac aaaattagca ccttccgatt ctctagtcca 360 

aatgaacttt ggctaaataa aaaattatta tactacataa taaagttnca gatagcagga 420 

aatgcaagag ctaggagatt cctagattat atctggccaa gccaaatacc ttaaacatcc 480 

acctggaaat cctctacccc ctcttctgag ataatttgcc cagccctttc ttcccacaca 540 
ctcactcaat gtcaccccct tctaatcccc aaaactgttt ttgtggcctt ggtagcctat 
agtagtttct cacatctttt cccctanact tttctgtttt cagtttcaga ccaaaaaaac 

tcttcaactt ttttccagtg gggtcttcct taccagtaac tttaccactt gnaatcttat 720 

ttcattgaaa aaaccttaaa tgggntggga aaaggcttgc cnncann 767 

<210> 440 
<211> 752 
<212> DNA 
<213> Homo sapiens 



600 
660 
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<400> 440 

nagnnnnntt tctaatgctt ggnnnnnnnn tcnatgcttc caaaagcngg gngctcgttc 6 0 

tttctccaag atncnngcgn tncgaattcg gcacgaggat ggatgagact gttgctgagt 12 0 

tcatcaagag gaccatcttg aaaatcccca tgaatgaact gacaacaatc ctgaaggcct 180 

gggatttttt gtctgaaaat caactgcaga ctgtaaattt ccgacagaga aaggaatctg 24 0 

tagttcagca cttgatccat ctgtgtgagg aaaagcgtgc aagtatcagt gatgctgccc 300 

tgttagacat catttatatg caattcatca gcaccagaaa gtttgggatg tttttcagat 360 

gagtaaagga ccaggtgaag atgttgacct ttttgatatg aaacaattta aaaatcgttc 42 0 

aagaaaattc ttcagagagc attaaaaaat gtgacagtca gcttcagaga aactgaggag 480 

aatgcagtct ggattcgaat tcctggggaa cacagtacac aaagccaaac cagtacaaac 54 0 

ctcctacgtg gtgtctactc ccagactncg tacgccttca cgtnctcctn catgctgang 6 00 

cgcaatacac cgcttcttgg gtcangaatt agaagctact gggaaaatct accttccgac 660 

agaagagatc attttagatn taccgaatga anaaagcttg cattagtgac attgaaaggg 72 0 

aaataaaaat tcctacagtc naaaaaaaaa at 752 



<210> 441 
<211> 775 
<212> DNA 
<213> Homo sapiens 



<400> 441 

gnagccngat tccaaaacct gggnnccgat ccaatgcttn ccaattactt gggagctcnn 60 

actngcncna ncaanctngc cntgcgaatt cggcacgaga agnaggcgga gcttgcagtg 120 

agctgagatc gcgccactgc actccagcct gggcaacaga gtgagactct gtctcaaaaa 180 

aaaaaaaaaa aaatggaacg cagggcaaga actcgtnttt ggaaggagat gggggaaagg 24 0 

ancggtatta tacctatgtt gnatttgcag gcaaatgaga tgganccctc tctgtaaaga 3 00 

agagtcattt gtgcaagtag acggggtctg tgggtgcang ccctggaggg gcacacaatt 360 

gcctgnangc ttctgtgana tcgggagang gaggagaagc agtctcttga caaaataaag 420 

tatttttatt cattngtatt tattaaatga aaaaacaatc ccatggtgtc ccttgtgtgt 480 

ggtggaacct aatgactgtt gaaataaagt ctgngttttc ccttcaaaaa aaaaacncnn 540 

anaanaaaaa ctcgagccct ntaaaacctn tngngagtcc gnattaccnt anatcccnga 600 

cnttgataag gatccattga tnaantttgn cccaacccca actnngaatg ccnngaaaaa 660 

aaattgcttt atttgggaaa tttgcnaatn ctttgcttta ntttgnaccc antttancnt 720 

cannnnccaa gttacnancn ncaattgcnt tcatttangg ttcaaggttc aaggg 775 



<210> 442 
<211> 804 
<212> DNA 
<213> Homo sapiens 



<400> 442 

gagnnngntt ctatacctgg gnncgatcca aancttncct attaccttgg atcttnngct 60 

atctcnaann aaaangcttn cgaattcggc acgaggccac ctgcactgag gtctgggccc 12 0 

ggggacaggg tgctttagcc aggcttgtct gcgcctcagg gaagggtgag cagcccaggg 180 

accagatgca agttggtggg cccctccacc cntccnacgc cactccccag tgtgctgggt 24 0 

cctaaccagt cgtcctatgg gagcagtcag ccttcctctc ctcctcaggg cagctctccc 300 

acctgctgnt ccccgcacac agaacctcat tgctctgagc agttgcttat tacccagttg 36 0 

ttgaaaaact agcatgtgan ggccgggcgc ggtggctcac gcctgtaatc ccaacgcttt 420 

gggangccaa ngcgggtgga tcatgangtc aggagatcaa gaccatcctg gcttaacacn 480 

gtgaaaccct gtctctacta aaaatataaa aaagtancca ggcgtggtgg tggcccctgt 540 

agtnccaann tacttgggaa gctnangcan gaanaatggc ntgaacccaa gaaggaagaa 600 
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cnttgcantg aancttaaaa ttgcgcccac tggaatttca aaccttgggc cnanaanaat 66 0 

tgaagaatcc cgtcttaaga aaaaaggaaa aaantttncc nttntnaaag gcccggccac 72 0 

aantnggctt taacgccctg gtaaatnccc aancactttt tggggaaggc ccaaaggcaa 780 

ggccnggatt caattttnna aggg 804 

<210> 443 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 443 

gnagccggat cttattattg gcnncgnttt aatgctggct aatntntcgt aatncttggt 6 0 

nnccccaann annnaggngg ggngaattcg gcacgagcac catttttatt ttgatgctta 12 0 

cactcattta ttctgttttt gtaaaacagt ttcaagaatt taaaaatcct tccagttaat 180 

agagcttttg ttattatatt ataattttgt aaacccactt tgtttttccc actttaaagc 240 

cacagggtcg actcatggat gatacctcta ttgctgctgc atgatgttca agaccggccc 300 

ttggctgttg ttacagagat gttgggcaga gctatgcagg tgtttcattg ngaactctag 36 0 

ctttgatcat ggtaaaaagt taaccctttc tattttttaa tggatgttat accaactatt 42 0 

cagaggactc atacttcaaa aatattagga aaatctgtct tatagttctc taataaatat 480 

ctgaaatctc aagtacgaca tgaaagaatg tcagaccatt gntattggtg aaagtcattt 54 0 

gatgaatggn aaattctatg aaaagtaagt ggatttgcat ggattaatat cagggaaaat 600 

ttaagccttc ccaagtgtga ctgggccaaa gagagccaga tgcccccagt gcctgtgccc 66 0 

ataaagttcc cgaatccccc aatggggtct nttttcaaaa acttggncca gacccggaaa 72 0 

ataaaancat tcntcataaa ttcaannggg gncctcanga aacacnttcc cccancaacc 78 0 

cttngg 786 

<210> 444 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 444 

gnagnccggt tcnnangcnt nggctnnatc caatgctggc taaagttcna ananctggca 60 

acnccaggan ncangcgttg cgaattcggc acgaggagga attacaggta gcaaattatg 12 0 

gagttggagg acagtatgaa ccccattttg actttgcacg gaaagatgag ccagatgctt 18 0 

tcaaagagct ggggacagga aatagaattg ctacatggct gtttnatatg agtgatgtgt 24 0 

ctgcaggagg agccactgtt tttcctgaag ttggagctag tgtttggccc aaaaaaggaa 300 

ctgctgtttt ctggtataat ctgttgccag tgggagaagg agattatagt acacggcatg 360 

cagcctgtcc agtgctagtt gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 42 0 

acaagaattc gaagaccttg tacgttgtca gaattggaat gacaaacagg cttccctttt 48 0 

tctcctatng gtgnactctt atgtgctgat atnccatttc ctagtcttaa ctttcaggag 54 0 

tttacaatng ctaacactnc atgatngatt cantcatgaa cctcatccat gttcatctgn 600 

ggcaattgct taccttgggg gntcttttaa aaagtaccac gaaatcatca tattgcatta 660 

aaacccttaa aagttctggt gggnatcaca gaagacaagg ccnaanttna aagnggagga 72 0 

attttattat ttaaaagaac cttttgggtn ggatnaaaan 76 0 

<210> 445 
<211> 761 
<212> DNA 
<213> Homo sapiens 
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<400> 445 

tggtgccggt tcttantctg ngctctcgtc ttccttctta tacctgggca ncncttggcg 60 

gccccnaggn tcccangnag ccnngcngng ncngattcgg cacgagattc caaaggtttc 12 0 

aaagaacttg gtcataaata tgataatgag aagacaaagt atttatatta aaacagttta 180 

gtagccttca gttttgtgaa aatagttttc agcacagaaa ctgacttctt tagacaaagt 24 0 

tttaaccaat gatggtgttt gcttctagga tatacacttt aaaagaactc actgtcccag 300 

tggtggtcat tgatggcctt tagtaaattg gagctgctta atcatattga tatctaattt 360 

cttttaacca caatgaattg tccttaatta ccaacagtga agcactacag gaggcaactg 420 

tggcattgct tccttaacca gctcatggtg tgtgaatgtt ataaaattgt cactcagata 480 

tattttttaa atgtaatgtt atataagatg atcatgtgat gtgtccaaac tatggtgaaa 540 

agtgccagtg gtagtaactg tgtaaagttt ctaattcaca acnttaattc ctttaaaatn 600 

cacanccttc tgcctctgna tttggaagtt gtcagtncaa ctcatcaaag aaaactgcct 660 

aatntnaaaa tcatattntg ggaataattt ccctcttttg tagtctgccc aagatcctta 720 

aagattggat ttttattact atttaaacca gtggattaat n 761 

<210> 446 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 446 

tgggnnngnn ccnaangcng gggannnggt ccccgttcca anactggaan ncttggcann 6 0 

cgaactcgct cnannagnaa ggccgggnga attcggcacg aggccccgct ccatgagcag 12 0 

tgactcccca gctcctcctg gcaccagtcc ccagggctct cctgttggta gttcctgctt 180 

ttcttcttgg aaattcctcg tggacctcga gatctttacc ctaaaatagt tctgttgaat 240 

ttcaccctgg caatgtaaat tgatagctta tcttcacaga tgccagacaa tggacaactc 300 

accatcagtc ctctgctcac ctgagacaaa tgcatgtctg attgcttcct ctgccctatt 360 

ggntatgtga aaatgcagat tcactgagcc agactaaggc atcagtgact ggtcctctac 420 

ctgcctctca catggagatt gggtattcag tgaaaggctg atcaaagacc caaaggaatg 480 

caacagttta tctcttatct acctatgacc tgcganctgc caccaccccc agntggngcg 540 

cctttccaga cagaaccagt gtacatctta cacgtattaa atngatgtcc cnggggctcc 600 

cnaanangna tcaaacaagc ngggcctcga ccaccttggg cacatatccc nanggacatc 66 0 

annctggagg ctngngncac tggcattggc cctnaccctn ggcaaaataa accttctaaa 720 

attggnaaaa aanaaanaan aaaaacctng nnccctntna naacnntacg 77 0 

<210> 447 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<400> 447 

tggtatnntt tnaangctgg nngnnnggcn ccgttccaat gnctggganc nttggcaatc 6 0 

gctctttccg nangatccca tcgattcggt ctgatgcagg agaattgcta aaacccagga 120 

gggagaggtt ncattgagcc gagattgcgc cactgcactc tagcctgggc gacagagcaa 180 

gactccgtct cgaaagaaag aaagagaaag gaaattcccc agggaagtac ctcggcttat 24 0 

ttcataaaca ggtactgaag gaagcagagg catgtggagg acttccccac ctcgtgcagc 3 00 

tatttgggcc gtggcatctg aaatttctta tttcagagtc acccctttga tgaccttggc 360 

agtgaactgc agtcatctgt ttaggccttt ccatggccca cgtcaatgcc ggtatttctg 42 0 

tttgttgcac atttgatttc cttgttgttg gcatttagaa ggccccccgt ttcccagatc 480 

acaccacggg catggaccac agagattgca tcttgtgagt ctgtagaaat ggtcaaggcc 54 0 

ttgtcctctc ttaagtccag agctcangtt aatgcaaaat tttnccggnc atctgtgctg 600 
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aaatcccttt ggggaagctc ctggctggtt tcctgtaggt aggacagcta cacgtnctgc 66 0 

cctttattgg cttcttttca tgaagctcct gccatntacn aaacatgtct cccttcttga 72 0 

atcacatctc tggtattgna actctanaat cgcccgg 757 

<210> 448 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 448 

gggtgnnnng tttctaatgc ttggtngnnc nggnccnacn tttctaatgt tcggaanggc 6 0 

ttggctactc gntctttctn cangnagccc ntcggtncga attcggcacg aggtgtcttc 12 0 

atcttaccca gtggaaccta agaaattaaa ttctccagaa gaaactgctt ttcagacacc 18 0 

aaaatctagc cagatgcctc ggccttcagt gccaccatta gttaaaacat cactgttttc 240 

ttcaaaatta tctacacctg atgttgtgag cccatttggg accccatttg gctctagtgt 300 

aatgaatcgg atggctggaa tttttgatgt aaacacctgc tatgggtcac cgcaaagtcc 36 0 

tcagctaata agaagggggc caagattgtg gacatcagct tctgatcagc aaatgactga 42 0 

attttctaat ccttctccat ctacctctat tagtgctgag ggtaagacaa tgagacaacc 480 

cagtgtgatt tattcatgga ttcagaataa acgtgaacag attaagaatt tcttgtcaaa 54 0 

acgggtgctg ataatgtatt ttttcagtaa gcacccagag gcctncattc aggctgtttt 600 

ttcagatgcc caaatgcata tttgggcatt agaaaggtct gtcgcactta gtagcagcat 66 0 

cattttacag aggatagatt tggagttgtc cagacgacac taccagctat ccttaatact 720 

ttgttgacac ttgcaagang cagtcngaca agtactttaa cttcctcatg 770 

<210> 449 
<211> 792 
<212> DNA 
<213> Homo sapiens 

<400> 449 

ngagaaangt ttctaatgct ggnnnngnna gntcancctt tctaatgttc taatacttgg 6 0 

ganntcgaac tntcncnaca cagnnangcn ntgcgaattc ggcacgaggn cnnctcnatn 12 0 

atnacttgnt cncancggnc tggcatcnac ncgncacacc tacntnagcg cnttgtagcg 18 0 

caatatncac ctnntnaaac ccnnnagtcc cagggctctg ccnnnnnact gntcaactga 24 0 

cnaacnacnn nctancncaa cntnnnntta ngccnctgnc tgnctctatg gcacctnncc 30 0 

tnccntcncn cntnacccnc tacgctcagg gctatataca atgggaacct tnccaacagt 36 0 

aanccntgga tctnaggnat ggcccttgnc tggcggatca cagccttnna gcntatcagn 420 

atcttgagga agacaccatt ccgtcccnga ttntgaccaa ncnctcggat gtgnctatgg 480 

gctcnattga ggnacaacaa ctnncactgc nnataggcca tcctcnnnan nctacacatg 54 0 

ngactttncn nnncatntna aatgnnnana tgtctctcnc aagcatcacc cnctgtccct 600 

ncgncntcnt ggaagacctt ctgnncaact ganctccttc ntgnnncnnn ngattnttnc 66 0 

nnncnnaata tncntncccc aatgnccttg tnnngnattt atnangggnt ttccaatttg 72 0 

ggntaattca ntncccnccg nannctannn ncccatnaac cntcngngcc ttcttgnaac 78 0 

cttttnncct gg 792 

<210> 450 
<211> 848 
<212> DNA 
<213> Homo sapiens 

<400> 450 



114 



BNSDOCID: <WO 9933982A2_I_> 




WO 99/33982 



PCT/US98/27610 



gnatgncccg atttccttaa 
annctgggng ntcgcaactc 
caagttgntc tctgttctag 
gaatcccttt gtcctttttt 
gactgaaaat acagaactca 
agcaaacgga gatcganccg 
gaccctgaga gaaagaactn 
aaatacgntt cctgacactt 
acactggatg aaaaaaanaa 
tncnccanng nnganccnnn 
nnnnnngggn aaacccnnnn 
nnggnnctnn nnnnannnnc 
annnngggnn nnnncnannn 
nnnnnnnnnn ncnngngnnn 
nnnncccc 



tgatggggnn nnnnngagcg anncttccga aanttccaat 
nctcnanaca gnaaggncgn gggctttgct ctctccattc 
aaagcagatg tagtagacat ctactgttgt tgcctgaaca 
tgntaaaagt actcatccct aatattcatt gtnctggaag 
caccatgatc ggccgggaca atcagattat ttcattccnc 
aaaagtggaa anatgagcnc ttctttggng ttggcatatg 
tnattntttc tcttggactg caataaagta tagctgccta 
ggaggnttgt ccacaatcgg ngaaataaag gcgagaccgn 
gnnnccngnn gaanacccac tnnnccannn nccnnnccnn 
tanccgnnan naggccnnng cnntngcnnc nnngccnnnn 
gnnnnncnnn nnnnnnnncn nnnnannnnn nnncnncnng 
cccnncnncc cnncnccnnn nggnaanncc nnnnnnnann 
ccnnnnnnnn cannncnncn cnnnnnggnn nnnnncnnnn 
acnnnngngn nnnnccnnnn nnnnnncngg nnnnncnnnn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
848 



<210> 451 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 451 

gnnnnnnntt tcctaaatgc ttgggnnnnn nnngagngnn nttncnnagt ttcctaanta 60 

gcttnggcna ctcgttctnt ctncangcag nnnntgcgtn gncgaattcg gcacgagcat 12 0 

tcctcctttg ttaacgaagc aacatttaca caagatggac attacattat tagtgcatcc 180 

tctgatggca ctgtaaagat ctggaatatg aagaccacag aatgttcaaa tacctttaaa 240 

tccctgggca gcaccgcagg gacagatatt accgtcaaca gtgtgattct acttcctaaa 300 

aaccctgagc actttgtggt gtgcaacaga tcaaacacgg tggtcatcat gaacatgcag 360 

gggcagattg cagaagcttc agttctggta aaagagaagg tggggacttt gtttgctgtg 420 

ccctctctcc cgtggtgaat ggatctactg tgtaggggag gactttgtgc tctactggtt 480 

cagtcagtca ctggcaaact ggagagaact ttgacagtgc acganaagga tgtgattggt 540 

attgcacatc accctcatca gaacctgatt gctcctacag tgaagatgga ctcctaaagc 6 00 

tctggaaacc ataattcaac ttttcttttt taaatcaact cgaaagcatg tncttaaatg 660 

aacatattca tgtaangggc tttttttttt tgncactttt ctaagcaaat agatggctga 720 

attagtcacn gaataaattt gngaaaatca tggttaaatn ccaac 765 

<210> 452 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 452 

nnngnnnnnn ntttcctaaa tgcttggggn nnnnnngngn nnnnnttctn atgttcctan 60 

ngcnnnggng ctcgttctnt ctncacgnng ccngtgcggt gncggtctga ttgaaagctg 12 0 

ttcaggttta tcatgcaaat cctcgcctct ggctacggct ggctgaatgc tgcattgctg 180 

ccaataaggg gacttctgaa caagaaacta aaggccttcc cagcaaaaaa ggaattgtnc 240 

agtctattgt tggtcaagct atcatcgtaa aatagttttg gcatcacagt ctatacagaa 300 

tactgtttat aatgatgggc agtcttcggc cattcctgta ccagtatgga gtttgcagcc 360 

atatgtctca gaaatgcctt gttgctgctc ctgaagaaca gcaagatcca aagcaggaaa 420 

atggggctaa aaatagtaat caattaggtg ggaacacaga gagcacgaaa gcagtgaaac 4 80 

ttgcagcagt naaagccatg atggagatna attcattcca gcttcacctt cttctccatt 54 0 
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gagaaaacag gaattagaaa acttaaagtg ctccatactt gcttgcagtg cctacgtggc 600 

tctggctttt gggtgatacc tcatggcttt gaatcatgcn gatnaacttc ttcagcagcc 66 0 

caagctgcag gatctcttaa gttttgggac atttatatgc tgcagaaccc ttatcttctt 72 0 

cgacngaatn tctgtgccnt tctcacttga ccccgagaat gtnct 76 5 

<210> 453 
<211> 833 
<212> DNA 
<213> Homo sapiens 



<400> 453 

gngtnnnnnt ttctaangtt cntaatannn tnggctactc gttctttctc caggnatccc 6 0 

ntgcgattcg aattcggcac gagagaaacg ttctcaggtt gaccagctgc tgaatatttc 12 0 

tttaagggag gaagaactta gtaagtcatt gcagtgcatg gataacaatc ttctgcaagc 18 0 

ccgtgcagcc cttcagacag cttatgtgga agtcaagagg ctacttatgc tcaagcagca 24 0 

gatactatgg agatgaatgc actgaggacc catagaatac agattctaca ggggattaca 30 0 

agaaacatat gaaccttctt gagcacccca ggttttggca ttagaaaatg ggtacccctt 36 0 

ggttcaaaaa tgaacaaaga aagccttaga tttggatggg ggaacctgat ctgtccagtc 42 0 

tagaaggatt ccagtgggga aagtgtttcc atttccttng tcccctggct tggccaagga 48 0 

aagcgaaagc cctttcttga anagcaaccg tggatcattn gaccaggaac ttccttctgg 540 

ggtattaagc ttctttcaan tggaagggaa aggttccang gccaaaggaa aaatggaagc 600 

ccccaaccng atgggtttca ccctaantaa cctcaattgg aagggcttgg accaagaacc 66 0 

cnggaaaggc nanccattgc acccttaaaa ncaaggaaag tggaccacct ttggggcttg 72 0 

ncnttccntt ccgaaccagg ttgaaaangg gcttgaaaaa tggttgctta cccaaaaggg 780 

cgnacnttaa tggcaccaat tattcctntg gaccnttttt aatanccttt ngn 83 3 



<210> 454 
<211> 737 
<212> DNA 
<213> Homo sapiens 



<400> 454 

gnggnnnntt ctaagttcct aatnctgggc tactngttct ttctgcaggn atcccatcga 6 0 

ttcggcaaaa tcaatgtgga ctgaacataa atcacctgat ggaaggactt actactacaa 12 0 

cactgaaacc aaacagtcta cctgggagaa accagatgat cttaaaacac ctgctgagca 18 0 

actcttatct aaatgcccct ggaaggaatn caaatcagat tctggaagcc ttactattat 24 0 

aattctcaaa acaaaagaat ctcgcttggg ccaacctaaa gaacttgagg atcttgaagc 300 

aatgatcaaa gcttgaagaa agcagtaagc aagaagagtg caccacaaca tcaacagccc 36 0 

cagtccctac aacagaaatt ccgaccacaa tgagcaccat ggctgctgcc cgaagcagca 42 0 

gctgctgttg ttgcagcagc agcagcggca gcagcagcag cagctgcagc caatgctaat 48 0 

gcttccactt ctgcttctaa tactgtcagt ggaactgttc cagttgttcc tgacctgaag 540 

ttacttccat tggtgctact gntgtagata atgagaatac agtaactatt tcaactgagg 600 

aacaagcaca acttactagt acccctgcta ttcaggatca aagtgtggaa agtatncagt 66 0 

aatctggaga agaaacatnt taaccaggaa actgtanctg attttacttc caaaaaagaa 72 0 

gaagaggaga gccacct 73 7 



<210> 455 
<211> 718 
<212> DNA 
<213> Homo sapiens 
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<400> 455 



ggnnnnnntt tnnncngttn 
ntgcgatncg aattcggcac 
ataatagtga tggaaagaca 
caaattcttc ctcaggacag 
aagagtcatg tgttttaaaa 
agagatttta aaaatgcagg 
gcccttaatg aatttagaaa 
ttcagactcc tctagttggc 
gacgattgcc atatgaactt 
aggaaggaaa ttttggttat 
agtgtncaga ggatagagnc 
gttatgtctc caaaagtgag 



cntaaaannc tgggctactc gttctttctn cangnagccc 
gaggatgagg agtgtttaat cattgataca gaatgtaaaa 
gctgttgtgg gttctaactt aagttccaga ccagctagtc 
gcttctgtag gaaaccagac taatactgct tgtagtcctg 
aaacctatca aacgagtata taaaaaattg atccagttgg 
atgagctctt aaagccaatt tccagaaaag taccagaatt 
attctaaaca gccttctgtt tctgagcaat tgtctggtcc 
ccgaaatctg gatggccttc tgcatttcag aagccaaaag 
caggactatg ttgaagatac atcggaatac ctagctcctc 
aagttattta gcctgcaaga cctgttgttc tcgtcgctgc 
agaccacgtt ctaaaacnga gaaatcagaa gacatttnca 
tntcagctgt atgagttgac tctgctgaaa gtgacttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
718 



<210> 456 
<211> 739 
<212> DNA 

<213> Homo sapiens 
<400> 456 

gtggnnnntt ctnngtttcc aatangntgg gtctcgttct ttctnnacga tcnnntgcga 6 0 

ttcgcttggg aggctgagtc aggagaaatt gcttgagccc aggagatgga ggttgcagtg 12 0 

agccaagatc atgccactgc actccagact gggcaacaga gggagactcc gtctcaaaaa 18 0 

ctaaaaaaaa aaatncattt agtataccgg ggggtggggg ggagaaataa tgttatttcc 24 0 

tatgcgaaat gacgtgtatc cctgtaccca tgggtaaatg taaatatact gtgtctcttt 300 

tgggagagcc ttttagtaga ggagtcttat atgaagtctc tcataagtag ttcacttgag 36 0 

ttttgcagtt tgaaatctta aaggagcttt aattgacatt tattatacca attaagcttg 420 

gaatggggca atggatgcat ttccaaaacg tgtgaaagcc taacagctta tattgctgaa 480 

tgagaatctc ctgggtgtaa tttancactt agggaactgc gtgaacactc ccagccatta 54 0 

tgatgctggt accagcttta ntgtntaaat gccatganta ttctttctgn tctgttttgt 600 

gctctcttgg tncatttatt ttacccttta cngaataatt tcttgtaaaa tccntaaaaa 660 

tntttggcat ttaaaagtcc nntcttggan tnaananann nnnaanaaaa ancttncccc 720 

tttanaactt tngngggct 739 

<2X0> 457 
<211> 743 
<212> DNA 

< 2 1 3 > Homo sap iens 
<400> 457 

gtgnnntnnt tctnnngttt ccaattantc tgggngctcg ttctttctcn anncnnnnan 6 0 

tggttgncga attcggcacg aggnnanagg gnagctacat gnntnaccnt nttngnnctc 12 0 

tcagccangc tcnnctnnnn ctggtcctac tgctacatag aacacttgtt ntncnnggna 180 

actnntntat gtnnccnnga ntctctgnna ctngtttaaa tgctanttga taacaggcta 240 

tgcaaggnct gnaagtggan agcgtcatca ttcatcatnc ntnttanctn gantnnntgt 3 00 

atcctacatg ctttgattgg taaatgngcn tcagactggg actctcaata aatgnatata 360 

gangancttg ctgtggaaan ctgtcctctc ntatctntnc atgngnaant tccactncag 420 

tntgaactcc aaatgcnntn atnggnganc cctncttgta tagtggtgtc cattccaanc 480 

tgcnagggnc tagaaaccgt cggctntngg aaacnatggt gnnagttgan ctggtacang 54 0 

cngttntcac ctgcanctac cataaaatgg gnttacccaa gctttatcat ggaatggnta 600 

taaaaaacgc attnattgng cctttntaan cccattatnt gttnaatttn acttatggtt 660 
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ccccccattn aaattatnca attgggnann gangcttcna gtcnccatnt ttnaatggnn 72 0 

ttnncaaaaa aacgnttttt ttt 743 

<210> 458 
<211> 906 
<212> DNA 
<213> Homo sapiens 



<400> 458 

gnngnnnnnn nttnctaatg cttgggnncn cgtttctann nnnnnnnnaa nntttcctaa 60 

ttggttaggn gctcgnnctn tctccacnna gnnnngcggt gcgaattcgg cacgaggctg 120 

aatcaaggat cacaaactnc acatttngca cnttggtctn cacatncntg gttngggcag 180 

tcncagtnaa catggctntg gaaactnatn ttngnctngc ntcaaccatc tcgttcccng 240 

gggacccann ntccnnnatc ncgnnttncc tcgnnnatng gagngctnct tngnccannn 300 

atgggctccc nanaatangn ntnncnnngn nnatncancn ncngncaann ggntcnncnt 360 

nnnnnngccc tnntncctna tggnnngctn catgncccat nnnnnnggnn ancaataann 42 0 

naaanggtct nttcccncga nncccnnnnn ccnctaacan ngnacctcgc aaagggcccc 480 

aggcnttnnc tngnaaacca nnttngccaa nggtanttca aaggnngcct tngggacctc 540 

ccnannnngc cntggnnnta ccccgggnaa anggtngnaa acccnnccnn ngntgccnnn 6 00 

cccggncnng gaaanaaatt tccnnggnac ccagnntncc nccgnaannn anantannnc 660 

ccancccnaa cnttnngccc ncanccnttn gnnnntgnan tcnncnnncc ctttnnnntn 720 

nccaanncgg ccnggnnacn nncttnnacc tntttncncn naanngacnc caanntcctn 780 

nannaaaggg nggnnnnnnn nnncnttncc nnnggnagcc cnnnnnncct nncntnncnn 84 0 

aaaaattcnn cnntgnancn cccctnnnnt nangngnccc natnnnnnnn nnngnaaanc 900 

nnaccc 906 



<210> 459 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<400> 459 

gnngnnnngg nttcctaang ctggggnccg ntctnnnnnn nnnnnnnngt tcctaaanac 60 

ctaggngctc gntctngctc cacgcagncn gggcgtgggc gaattcggca cgagcttctg 12 0 

ttgattggtt tgtttaaagt acctaagtac taccctttga ctccctacca aaagttcttt 180 

tgttttttaa acaactttta tttgtgactt actttcttga gaagngttct taatgaattg 240 

cataaaatag tggtagcagc ttatttctta agtncttnat tattggggct ttaccattca 300 

ggtcttatct ttaaccctta tttactcagt tttccatctg aatgatccta tctctaaatn 360 

aaggatttaa taaatgctgc aaattgtcca ctttgcaaat ngtccaaaag ctttagtttt 420 

ggaccttgng aacttttttt ttaataacac attatttggg cccggtcgtg gtggctcaag 480 

cctgtaatcg cagcactttg gaatgcctag gcagacagat cacttaangc ctggagttcg 540 

agaccagcct ggccaatgtg gtgaacjacct ccggttctat tactaaaaat nctaaaaaat 600 

tancaaggca tggnggtgca cgcctgnaat ctcagctact tgagangcaa atcnggagaa 660 

atgcttgacc ngggangcan anatganccn anattgcacc actgcattcc acctggggan 72 0 

nanantgaga anctggctca aaacccaaaa acccaaaaaa aaaan 76 5 



<210> 460 
<211> 677 
<212> DNA 
<213> Homo sapiens 
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<400> 460 



gtttncgctg ggagccacca 
cctgaaacct ggctgatggg 
cagaccacag cactgatttc 
gttttgtggt taaactccct 
ggttatcaac ctctacatta 
aaacggttcc tatccaatcg 
gtggctgttt ccgttgctcc 
gaacttctaa ctgaccactc 
caccactttc atccctcctg 
tcagtgagcc ttatagagtt 
tatgtagctn atgcttattg 
gatgagaata tagcccn 



acatagcaga ttaccatgtg aagttgccac tgctgcatct 
agaggtctca ttttgtgtct gagaatgtcc aggttgtctg 
ccattagcag ttattatttc ctggccattt cttcctgaag 
gtcctcaata ttttatcagc agtagggctg tcattcttct 
tgaagtaagg ttcaaccctt ctgcttttct caggccccca 
aacacaaaaa cgggtattga gaaggaattg gcagggctca 
tacctcatgg agactcttac tcatgctgga tttattgaga 
acccccaccc actcttatgc agtctgttca ttcctgaaaa 
cacacaaccc atgagggatt gctacttcct ataagattcc 
gctgcgagaa ttacatttgg tcatgatgtc aagtgtctgg 
aacacatagt aatttattgg aataattgnc atgatcactg 



300 
360 
420 
480 
540 
600 
660 
677 



240 



180 



60 
120 



<210> 461 
<211> 787 
<212> DNA 

<213> Homo sapiens 
<400> 461 

gnnnnnnnag ggnnnngngg ggcctcncaa agcccgngcn acaggtcccc gttccaaagc 6 0 

ntgggnganc gcnncgcccc ancagnaagg cgggggaang cggcacgagg acatcatcnn 12 0 

cttattctag taagagaaag tacacagatt caactttaga gaggacnggg gggnnnncng 18 0 

gagcnaaatc aaggaaggan tatcacgngg ccncccnnga atataannnn gaagctgnga 24 0 

acagnaccat cagnaacann nnatggacag ctctgatggg gnnnatacca cggcactctn 3 00 

cnnaccnnng gnggaagcna tccggagnna tgactgangn gnaaagnggn nnactgnnag 360 

aanccnggng ngctaggann ctgggagagn cactttcang aagnnaccng gcgangagnc 42 0 

atcanaagaa cccgganaag ngagaagacn ggaaaaagnn cncancgnac ngagcccagn 480 

nannnncnct gagccanggg ctncgaaang ccccaccnga agcnccatca canggnacaa 54 0 

ggnnngggaa aaggaancna cnnngcngac angnccnccn aanagngcca aancacngcn 600 

nngcccncnc gcccaaagaa nacnggacng cnggcncnna ncanaaggag cncnanggcc 66 0 

cnnggnaang aaactncnag nagcccaanc ccaaaggccc cnanggannn ccnncaaggg 720 

gaaaacanna nncacccaag gggcctgggc naanaaggcn ncccacncng gcccnccnnc 78 0 

nnnaccg 78 7 

<210> 462 
<211> 747 
<212> DNA 
<213> Homo sapiens 

<400> 462 

ctaatggctt ggnnnnnnng nnnnccgntt cttaattgnc ttgggcnnct cgctctntct 6 0 

ccannnagnn nntgcgttng cgaattcggc acgagcctca gccccacacc agctctattt 12 0 

caggggtgag agtcagagag cactgcaata tgtgcttcat gggatttcga ttcgaagatc 18 0 

ctagaccagg gagacactgt gagccaggga tacaacaaaa tactaggtaa gtcactgcag 24 0 

accgacctcc ctgcagtttg ggaaagaagc tgggtttgtg gagaatcaga gcatcttgac 300 

atgactgctg acctaaagat ccctggcatt ggccagggat cctgtggaac ctcttctagt 36 0 

tcaggggtgt gagcattaga ctgccagttg tctagtgaca tctgatgctt gctgtgaact 420 

tttaagatcc ccgaatcctg agcacctcaa tctttaattg ccctgtattc cgaagggtaa 480 

tataatttat ctggatggaa attttaaaga tgaatccccc ttttttcttt tctnctctct 540 

tttctttcct tctccctttc ttctttgcct tctaaatata ctgaaatgat ttanatatgt 600 
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gtcaccaatt aatgatcttt tattcaatct aagaaatggn ttaagttttt ctctttagct 660 
ctatggcatt tcactcaagt gggacagggg aaaaagtaan tgccatnggc tccaaagaat 72 0 

tnntttatgt tttagctatt taaaaaa 747 

<210> 463 
<211> 750 
<212> DNA 
<213> Homo sapiens 



<400> 463 

tncctttcta angcnnntng nnaanngtcn ccgttctaan tncttgggca gnncgctctn 60 

tctncannca gncnntgcgt tgcgaattcg gcacgaggcg agatgaagct acactgtgag 12 0 

gtggaggtga tcagccggca cttgcccgcc ttggggctta agaaccgggg caagggcgtc 180 

cgagccgtgt tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc 240 

ctgctcatct ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt 3 00 

gagcaattct tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct 360 

cctgtggata tctgtctaag taaggattcc atatggctct catatcattc cattccatct 420 

ctgccaagat ttggataccg caaaaatttg tgttngngga agattctgnc tgaactcttt 4 80 

cattcaagga actactacca tgaatctgca ttctgntgcc cacactgagg ncttagtaga 540 

taattgggtg gtctgaaaca cctattatct cttatntctg gtctctangc tggnatgtta 6 00 

attcctctga aatgntaaaa gtaatgggtg anaccngaaa aagaaatttc aatnacagat 6 60 

caanntgggg ngcatgtatn attttcaagc gtcaaaatgg aataagggaa gantnctgga 720 

tacctgcttg gaaaaggaag natgtgtatn 750 



<210> 464 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 464 

gnngtgtctt tgnaaagcct ttggggaann gncnccttct aatgcttggc tatcgntctt 60 

tacgcagnnc ccatcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 120 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 180 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctcc gtcgtggatt tcgcggcgat 24 0 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 3 00 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 360 

tgggatcggc ccaaggccag ttctccgcag gttggntgct tctttcgttc tctcctctgg 420 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 4 80 

nccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcaaatgg agacgcagtg gtggccagag ggtgatggag 6 00 

aagacgggaa aagcgacagc cacgctnctg gcttgaagcc gcaggacgca aataacttac 6 60 

tttggacctg acagttctac gttgntgtgg angccctgtt tcctggaaat aaaactcaaa 720 

atggtggttt tttggaaaaa aaaaaaat 74 8 



<210> 465 
<211> 863 
<212> DNA 
<213> Homo sapiens 

<400> 465 

gggnnnnnnn aanggnnnnn ggnnnnngtc ccgttccaan gaccnngaga tcgnngncgc 60 
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tccanaagaa aggcggtgng aattcggcac gagacctgta ccgcctggcc actggctgtc 12 0 

accggcgtga tgagctgccg gtgtttgaac gcngcctatg cngggacttt cccggcanan 180 

nggcnngaan atggccncca tncaggaagc cgcccagaac ctcctnggnn acacnacttn 24 0 

agngccttcn agtccgntgg nacccggncc aagccccggc aancnctgcc ccgggtcncc 300 

gttcccaagg ccaaccagcc ctgggnaccc ccggggagcc gaaacnctgg ggctnggana 360 

ccgngantga gagncncact tttcnntgta nacacgggcc cagganacan ctntgctcgt 42 0 

ggccccgggg naaannnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 66 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 72 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 84 0 

nnnnnnnnnn nnnnnnnnnn ncc 863 

<210> 466 
<211> 713 
<212> DNA 
<213> Homo sapiens 

<400> 466 

ngtctttcga gcntggngnt cgttctngct cnannanatt ggttgnggga attcggcacg 6 0 

agcctcagcc ccacaccagc tctatttcag gggtgagagt cagagagcac tgcaatatgt 12 0 

gcntcatggg atttcgattc gaagatccta gaccagggag acactgtgag ccagggatac 180 

aacaaaatac taggtaagtc actgcagacc gacctccctg cagtttggga aagaagctgg 240 

gtttgtggag aatcagagca tcttgacatg actgctgacc taaagatccc tggcattggc 300 

cagggatcct gtggaacctc ttctagttca ggggtgtgag cattagactg ccagttgtct 360 

agtgacatct gatgcttgct gtgaactttt aagatccccg aatcctgagc acctcaatct 42 0 

ttaattgccc tgtattccga agggtaatat aatttatctg gatggaaatt ttaaagatga 480 

atcccccttt tttcttttct tctctctttt ctttccttct ccctttcttc tttgccttct 540 

aaatatactg aaatgattta gatatgtgtc aacaattaat gatcttttat caatctaaga 600 

aaatggttta attttttctc tttactctat ggcanttcac tcaantggac aggggaaaaa 660 

agtaattgcc atgggcttcc aaaagaattg ntttatgntt tagctatttn aaa 713 

<210> 467 
<211> 732 
<212> DNA 
<213> Homo sapiens 

<400> 467 

gnnnggtnnt ctaatncttg nnnnnnnntc ncccttctaa gccntggnct cgnctnnccn 60 

acnancnggc ttncgaattc ggcacgaggc gagatgaact acactgtgag gtggaggtga 12 0 

tcagccggca cttgcccgcc ttggggctta ngaaccgggg caagggcgtc cgagccgtgt 180 

tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc ctgctcatct 240 

ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt gagcaattct 3 00 

tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct cctgtggata 36 0 

tctgtctaag taaggattcc atatggctct catatcattc cattccatct ctgccaagat 420 

ttggataccg caaaaatttg tgtttgtgga agattctgtc tgaactcttt cattcaagga 480 

actactacca tgaatctgca ttctgntgcc cacactgtgg tcttagtaga taatttgggt 540 

ggtctgaagc acctattatc tcttatttct ggtctctagg ctggtatgtt aatcctctga 600 

tatgttaaaa gtaatgggtg agaccngaaa aagaaatttc aatacngatc aantttgggg 660 
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tgcatgttga atttgcaacc tcaaattgga gtaagggaan attctggata cttgctggaa 72 0 

aggaggaatg tn 732 

<210> 468 
<211> 748 
<212> DNA 

<213> Homo sapiens 



<400> 468 

gnnagnnttc taatngcttg trumnnnnna gacgttctaa nnctttggcn atcgttnttt 6 0 

ctncagnann ccntcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 120 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 180 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctnc gtcgtggatt tcgcggcgat 240 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 300 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 360 

tgggatcggc ccaaggccag ttctccgcag gttggttgct tctttcgttc tctcctctgg 420 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 4 80 

tccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcagatgg agacgcantg gtggccagag ggtgatggag 6 00 

aagacgggaa aaagcgacag ccaagctcct ggctgaaacc gcaaggacgc aaaataactt 660 

actttgnacc tgacagtttc tnacgtttgt tgtggangcc ctgtttcctg ggaaataaac 720 

tcaaattggt ggtttcttgg aaaaaaaa 748 



<210> 469 

<211> 776 

<212> DNA 

<213> Homo sapiens 



<400> 469 

gggngntcta atgcttgnnn tgattctccg tctataacng gntaatnctt ggncctacna 6 0 

aaaggctang ngaattcggc acgagacctg taccgcctgg ccactggctg tcaccggcgt 120 

gatgagctgc cggtgtttga acgcaaccta tgctggactc tcccggcaga ctgcctggat 180 

atggtcgcca tgcaggaagc cgcccagcac ctcctcggca cacacgactt cagcgccttc 240 

cantccgctg gcagcccggt gccgagcccc gtgcgaacgc tgcgccgggt ctccgtttcc 3 00 

ccaggccaag ccagcccctt ggtcaccccc gaggagagca ggaagctgcg gttctggaac 360 

ctggagtttg agagccagtc tttcctgtat agacaggtac ngaggatgac ngctgtgctg 420 

gtggccgtgg ggcttnaann tnannnnnnn nnnccnnnac caantctncn nannannnnn 4 80 

ccnacnnnta aaantnncnn ncnnnnncan nnnnnnnnnc cnnnnanncc nnnncttnnn 54 0 

naancnnnnn rmnnnnnanc nnnncancna nnnnnnnnna nnncnnncnn nnnnnnnncn 6 00 

nnnnnnnnnn nannccnnnn nnnnantnnn nnannnnnnn nnnnnnnnnn aaannnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncncnnnnn nnnnnnncnn nnnnnnnnnn 720 

ccnnnnnnnn nnnnnccnnn nnnnnntnnn nnnnnnnntn nnnnnnnnnn nnnnnc 7 76 



<210> 470 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 470 

tatgnntttn ctaaaatncn tgggcaanac gtcctcnctt tcctaanagn ttnggcanaa 60 
cccttggcaa nacgccngtn acccanacnc agnnnggccg tggcgggcga gcgggcaaca 120 
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gctcttgagg 
tgctgagttc 
gaaggcctgg 
ggaatctgta 
tgctgccctg 
ttntcagatg 
aaattcgttc 
aactgangag 
ccagtcaaac 
tccatgctga 
acctncgaca 



agtgagactg 
atcaagagga 
gattttttgt 
gttcagcact 
ttagacatca 
agtaaaggac 
aagaaaattc 
aatgcannct 
ctacctacgn 
ggcgcaatca 
agaaggggan 



cnggagatnt gggccgtgcc 
ccatcttgaa aatccccatg 
ctgaaaatca actgcagact 
tgatccatct gtgtgaggaa 
tttatatgca atttcatcag 
caggtgaaga tgttgacctt 
ttcanagagc attaaaaaat 
ggattccaat tgcccgggga 
gggggactac tccagactcc 
ccgcttctgg gncaagaagt 
catttanatt tacccnaaat 



aaagagatgg atgagactgg 
aatgaactga caacaatcct 
gtaaatttcc gacagagaaa 
aagcgtgcaa gtatcagtga 
caccagaaag tttgggatgt 
tttgatatga aacantttaa 
gtgacagtca gcttcagaga 
acacagtaca caaagcccaa 
cgnacncctt cacgtcctcc 
tanaaacnct gggaaaaact 
gaana 



180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
765 



<210> 471 
<211> 820 
<212> DNA 

<213> Homo sapiens 
<400> 471 

cnrmringggn nnngnggcgn cntccnaaan ccggggcgac agngccnnng ttccaacaga 6 0 

ccnggngngc cgncngngcc ccanacagca ngggnggggc nnnggggnnn cnncgncnnn 120 

cnnancnaca aagaactcaa caagaaaaaa acnaacccca caagcgggca aaggacgnga 18 0 

acagacantn cccaaaagaa gacatacaag caaccnaaaa taatcnaaaa taagnnncaa 24 0 

aaagaaaaaa ngcnagacag agnngngana gnactnagna aaaagngana tctagcggcn 3 00 

annagnangn nngnnnacgg ncngnnncna agaaanagnc nctggnnccc aagcnggagn 360 

acagcggcgc aagcnnggcn cactgcaacc gcgaacnccc gggctcaagc gaaccnccag 42 0 

cctcagcctc ccaagnagcn gnnaaaggca ngcaccacca cacccgacna aaatancngc 480 

nancaanaac ananaanggc nccccngngc nnanncagga aanaaacacn cnnangcnnc 540 

ngaaaanaan naancncncn cnnnacaaaa aaacnnnagc cnnagaacaa nnnnggaggc 6 00 

ggaanacggn nnancccgac anganaanga nacnanngan gganganngg gaccaaaccn 66 0 

cancccggga anggcnnngn aaaaaaaang ccnnnaaann gggggaaaan ncggngnang 720 

ccnaaagggc cnnaaanggg gaaacccnan naaaangccg ggcanannan aaccnagcnn 78 0 

nancnanccn nccaangggg nannnccncn nncnaggccg 82 0 

<210> 472 
<211> 738 
<212> DNA 

<2 13 > Homo sapiens 
<400> 472 

gnngtgtctc taatgcttgg ctactngttc tttccgccga acncttgcta atgcttggcn 60 

ntcgttcttt ctccacnnac nnngcnntnc gaattcggca cgaggtcaca ganatnaaag 12 0 

tccaatcata ggggctggnc cnacntctnt gctnntccct gcangantca tangatcagn 180 

nanaccgtgc gnntttgnaa gcntttcaaa tgtgntacca tcgngttact tncnnnggca 24 0 

cctgntgann tnggttgnac tnnncnggat nctccaaanc caccnnnncn atgggntnng 3 00 

tgngcatgng ntggnncann nacagannna ganactttaa ngaannngnt tntgcaaccn 36 0 

tnggnnctag caancntgan antnccaggg nnggccacna agctgaaaat nnatgttana 42 0 

ncnnatgntg naatctctag natgacttcc ncannnancn aaactnangc anggctgcna 48 0 

tgttagaanc tanaggccna atttcttntc natgnaacca ntntatgctt ttaagaccnt 54 0 

caactgtnnc natgaagccc atntacatna ttncggtaat anggctatnc ttaaannnaa 6 00 

ctgctgaaaa tnatgatnca nctacgaaat cctnncancc ncatntggct naatcattac 660 

caaccatttg acaccnncat ngnctaccca cntgcattnc catgaccnan tccantgcca 72 0 
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cccgcncaga tntacctt 73 8 

<210> 473 
<211> 752 
<212> DNA 
<213> Homo sapiens 



<400> 473 

tatgnntncc taanagagtt ntgnnacacg gcccgccttc tnaaancttc ctaatncttg 6 0 

ggcgctcgtt ctntctncac ncagnrmntg cggtncgaat tcggcacgag gtccttttga 12 0 

accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga ctgttctact 180 

ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag gcccggggcg 24 0 

accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc gaatttcgac 300 

ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc caacccgaga 36 0 

tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc agccaggtgt 42 0 

gtgccacacg gcagccttcc cgaaacatag tatggatttt aaaaatgtgt ntatttttgg 48 0 

ttctcaacca ctttataacg tattttttaa tttattttgt aatgtcttgt tttgaagtat 540 

tgctgctatc cttggtatcc ttcccactgg ttttatcact ganttatttt gngaaagttg 600 

ncactaatgt tctatgtcaa aatcaaaagt atttaatgaa atactanntc tatttaatgt 66 0 

ggntatggaa ccagctggaa acacaaaaca aacagtgatt gacancaagc tgggcccaag 72 0 

agncaggtca ttttgnacat atgccaataa ac 752 



<210> 474 
<211> 752 
<212> DNA 
<213> Homo sapiens 



<400> 474 

ttgcanacnn aatanttgct gtaaaagtcc cnnctttttn ccctttctaa tgnttgngcg 60 

ctcgntctnt ctccacnagn nnntgcgttn cgaattcggn tctnagccca tgccgggatc 12 0 

ttcccacacc cgtcctcaca gatccagccc cagcccctgt cttcccaggc catctctcag 180 

cagcacctgc aggatgcggg cacccgggag tggagccctc agaacgcatc catgtcggag 24 0 

tctctctcca tcccagcttc cctgaacgac gcggctttgg ctcagatgaa cagtgaggtg 300 

cagctcctga ctgaaaaggc cctgatggag cttgggggtg ggaagccgct tccgcacccc 360 

cgggcgtggt tcgtctcctt ggatggcagg tccaacgctc acgttagaca ttcatacatt 42 0 

gatctccaaa gagctggaag gaacggaagt aatgatgcca gtttggactc tggcgtagat 48 0 

atgaatgaac caaaatcanc ccggaaggga angggagatg ctttgtctct gcagcagaac 54 0 

tacccgnccg tccaagagca ccancagaaa gancctcanc cccagacagc acggnctaca 60 0 

cgcanctcgt gnacctggat gacntggaac anaatggtan cnaatgtggg accacngnct 660 

tgtancccna ggacaaggcc ctncnangct tgntggangg g tenant cng anaaatggng 72 0 

gccactgccc aacccgcang aaganaacaa nn 752 



<210> 475 
<211> 742 
<212> DNA 
<213> Homo sapiens 



<400> 475 

gntttctntt aatncttttn naaangcgnn ntttaccntt ctangnntgn gnetegttet 6 0 

ttcccacnna nnnnncggtn egaattegge ncgaggtgaa acagaaagtg gagatgettt 120 

ccttgacctg aagaagcetc ctgcctccaa atgcccccat cgctatacaa aagaagaact 180 
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cttggatata 
gacagtgatg 
cggagctcac 
cgcaggatag 
cctcagagac 
tcangaagtc 
ggcagtggga 
acctgcggga 
anaanttttg 
cnnaanaaga 



aaagaactcc 
gtgtctggga 
cagtggaaag 
tagatccacg 
ngagctttgg 
cattagagaa 
ggataatctc 
cttgagagac 
gagaaagtaa 
acccgaatgg 



cccattccaa acagagcctt catgcctttc tgaaaaatat 
ccctgagaag tggcatgcct ctctctaccc agcttcaggg 
tctgaagaaa gagttggata cagaccggcc ttccctggtg 
agagcgtgtg aaagaagatg acttanatgt tgttctcagc 
agggggctgc cacgtgacag ccgctgtcag ctcccggcgc 
agatagtgat gggcttcgtc tgcttggtgg acgtaggatt 
tgcccggacc tttgagaagg atcaccgctt aacgataagg 
agagaccnan anaaggactt caaggacaac gtttcangan 
ncntgtcttt tggtgancgt anaaanaaat gattcttacn 
tt 



240 
300 
360 
420 
480 
540 
600 
660 
720 
742 



<210> 476 

<211> 1122 

<212> DNA 

<213> Homo sapiens 



<400> 476 












gnnngggnnn 


ttctaaaagc 


tgggnnnnnn 


nnngaggnnc 


ttctaatnct 


tctaatggtt 


60 


ggctctcgtt 


ctttctncac 


gcagcnnngc 


gnnncgaatt 


cggcacgagc 


ctgcagccac 


120 


taatgcattg 


tgtatgataa 


caaaaactct 


ggtatgacac 


attttctgng 


atcattgnta 


180 


attagtgaca 


tagtaacatc 


tgtagcagct 


ggttagtaaa 


cctcatgtgg 


gggtggggtg 


240 


ggggtgtatn 


cctngnggga 


nggnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


300 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


840 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


900 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1020 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1080 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nc 




1-122 



<210> 477 
<211> 747 
<212> DNA 
<213> Homo sapiens 



<400> 477 

gnngtgcctt tgaaannccn tttgnnnnng nggcccttct aatgctnttn cgntcgnggg 6 0 

gtcgaactcg ccccacncng cnaggcgggg gctncaagcg attctaaacc acctatgagt 120 

atttctttta gggctcactt aaatacatgt ntgngnntac tgggggctag ccngaataat 180 

tttagatctg atcaggtngn ngctnaaatt ngaaaaanac cnnntngatg cttaaagaat 240 

tngcntccat ttttgagtct aaatctttta aaatntactg ngatccacat ctagngaaat 3 00 

gtcngtgtca anatattctn gatnatcgct naaatccnca ttaatactcn ttnggggtnn 360 

nnnatagngg aacttcntag nnntncnaaa agcacatngn cttcctgnct ccgctgctcc 420 

cacagnnggt nttgnaactg ggnaaatcag nnnnnngata gcgngngnnt ntnaganaaa 480 



125 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 



PCT/US98/27610 



ntngatncac acatncttnn nnctcagncn 
ctgnggngga tgangttgga gttcgannga 
ccnngncttt cccntnccct cnctngaaag 
tganngnncn gtcnngcccn cttcngncnc 
cnnngnnntn cgngnaaccg nnccngc 

<210> 478 
<211> 746 
<212> DNA 
<213> Homo sapiens 



<400> 478 

gnnnnnngcg cgnccttcta atgcttcnta attnnctngg atactcgttc tttctncagg 60 

natcnnntgc gnttcgcaag gagnagagtg atagnaattg gcagtgaaat atacgaacca 12 0 

ccctcctgcc ctctgggttc acaatacgtg tacacttgac tgtgaagtgg ctgtgagagt 18 0 

gggtggagag ttcttctttg accctcagcc tgcggatgcc tctagaaacc tcgtgttgat 24 0 

tgcaggagga gtcggaatta accctctgct ttccatcctg cggcacgcag cagatctcct 3 00 

cagagagcag gcaaacaaaa gaaatggata tgagatagga acaataaaac tattctacag 360 

tgcaaaaaat accagcgaac tcctgtttaa gaaaaatatc cttgatttag taaatgaatt 420 

tcctgagaag attgcatgca gtttgcatgt tacaaaacag actacacaaa tcaatgcgga 480 

actcaagcca tacatcacgg aaggaagaat aacggagaag gagataagag atcatatttc 540 

aaaagagact ttgttctata tttgtggccc acctccaatg acagactttt tctccaagca 6 00 

actggaaaac aaccatgtac ccaaagaaca catttgcttt gagaagtggt ggtaggaggc 66 0 

aagaccaaag gcaggaaaaa attaangagg tgagatctac tcaaggagag ctcaaaaaaa 720 

aaaaaaaaaa actngggccc tttaga 746 



<210> 479 
<211> 750 
<212> DNA 
<213> Homo sapiens 



<400> 479 

gnnnnnnnnn nngngnnnnn ttctannntt cntattnnct nggagctcgt tctttctnca 60 

ggatcccntg cgattcgaat tcggcacgag ggtagactgg ctagggatcc tggacccagg 12 0 

gttccacgta gcaacacctg ctgagttctc tgggttttct tcctgcctca tgtagcccag 180 

acttggagct gaagaagctg gaaacatgga aacaccaaca gctacagacc aaaaaaagtc 24 0 

ccaacaaagg cctgtcagtc tgccagcctg ttctgtggat ttccaactca agattgcagc 300 

atcaactcac acctgaagtt ctggcttccc tacaaacttt gaacttgcca gtccccacaa 360 

tggcataagc caattcctta aaatgaatgt ctagttctag ataatgtgtg tattctactg 420 

gttctgtttc tctggagaag cctactaata gatcatttgt cttagtcaat tcaagctact 480 

ggtacagatt accatagact gggtggttaa aactaccaat cttattactc acagtttttg 540 

gagtctggaa agtctgagat cagggttcca gcaggattga gttctttggt gaacatnctc 600 

tttctggnct acagaatact gggttacttt aagtnggaaa aagtagggtg aagctgggtc 660 

ntttggcctc ttcttttaag ggggactaat tcatgaaggg ttccaccctt attgacctat 720 

tttaccttnc caaanggntt ccattttccn 750 



<210> 480 
<211> 714 
<212> DNA 
<213> Homo sapiens 



ncacatngat tgaacactct 
agaagccngc gctggcctgg 
ctgcccngac ngaggccnaa 
ttngaaccnn nnagnggnnc 



ggccaagatg 54 0 

cttgnaagac 600 
ngnaaatggn 66 0 

tnnnngnacc 72 0 

747 
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<400> 480 



gnnnnnngnn nngnggnnnt 
gctaggcttg ngaattcggc 
ttatttgaac cctataccaa 
acagtgccct gcatgaaggg 
gtggctttgc tctctccatt 
tctactgttt ttgcctaaac 
taatattaca ttgttctgga 
ggacaatcag attatttcat 
agctcttctt tggtgttggc 
gactgcaata aagtatagct 
tcggtgaaat aaangcaaga 
tcgaccttta nactatnnga 



tcnaccgttc ttntgaccta gnctcgntnt nnccnacnna 
acgagataac acacatcaca gtatgctctc agaaatttct 
tatctgntga tcaatgacca tttttgctca gcatggagaa 
tagtgagaat aaaaaggatc ttaccacctt tatcatgagg 
ccaagttgtt ctctgttcta gaaagcagat gtagtagaca 
agaatccctt tttccttttt ttgttaaaag tactcatccc 
aggactgaaa ataacagaac tcagcaccat gatcggaccg 
tcctcancaa acggagatcg atccgaaaag tggaaatatg 
atatggaccc tgagagaaag aactttaatt ttttctcttg 
gcctaaaata cgtttcctga ccttggangt ttgnccacaa 
cgtacacttg gatgaaaaaa aaannnnnnn naaaaaaaac 
gtcgatacnt aatcngactg atagatcatt gnta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



<210> 481 
<211> 742 
<212> DNA 

< 2 1 3 > Homo s ap i en s 
<400> 481 

agccnttcta aangccnttt gnctnngnnc cccnttccta anncntggct aatncttggc 6 0 

nactcgttct ttctncacgc acccatcgnn ncgaattcgg cacgaggcat gaaaggagtc 12 0 

ggaagcggaa gcggtagccc ggacggtgct gtggtgcaag ggcttgtgga aaaattggag 180 

aaaaccaagt ccctggccca gcagttgaca agggaggcca ctcaagcgga aattgaagca 240 

gataggtctt atcagcacag tctccgcctc ctggattcag tgtctcggct tcagggagtc 300 

agtgatcagt cctttcaggt ggaagaagca aagaggatca aacaaaaagc ggattcactc 3 60 

tcaagcctgg taaccaggca tatggatgag ttcaagcgta cacagaagaa tctgggaaac 420 

tggaaagaag aagcacagca gctcttacag aatggaaaaa gtgggagaga gaaatcagat 480 

cagctgcttt cccgtgccaa tcttgctaaa agcagancac aagaagcact gagtatgggc 54 0 

aatgccactt tttatgaagt tgagagcatc cttaaaaacc tcagagagtt tgacctgcag 600 

gtggacaaca gaaaacagaa ctgaagaacc atgaagagac tctnctacat caccagaagg 66 0 

ttcagancca atgacaagac ccancaagca naagagccct ggggagccct ctgctgatcc 72 0 

caaanggcaa aaaatggggc cn 742 

<210> 482 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 482 

gnnngnggag nnttctaatg ctttgntcta gagtcncnnt tctaanggct tggnaatnct 60 

ngctcttgtt ctttntgcag gatcccatcg attcgaattc ggcacgaggc caagcctcgg 12 0 

cctccactgc acctgctgcg gagtggcacc tttgcctgca aggccctcta ccccatggcc 180 

cagtgtcatc tcagcagggt ctttggccac tcaggaggcc cttgtggtgg gttgctcagt 240 

ctgtccttcc ctcatgagaa gctactgctt atgtccacag accaggagga gctgtcacgc 300 

tggtaccaca gtctgacttg ggctatcagc agccagaaaa actagaggaa tcttatagat 360 

tccagaactc aggatacctc agggataggt cacagccaag agtacaaagg aatcttcagt 420 

actgaacaaa acagaaccct tcatgatttg acaaaggtca ctttctgttt gcctggacca 480 

agctactcca gatcatctga ccaactctta aaaatcacgg ccaggcacag tggctcatgc 540 

ctgtaatccc agcactttgg gaagcaaang tggcaggatc attccagccc aggagtttca 6 00 

agancagcct ggcaacacag tgagtgagac cctgtctcta tttaagaaaa aaattattaa 660 
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gaaattttat taaaaaagga agaatcagga aaccaaagtc aaccccaact taaccctcaa 720 
tgaaccagcc ctaacacaga tgangggatt tg 752 

<210> 483 

<211> 849 

<212> DNA 

<213> Homo sapiens 



<400> 483 

gnnnnnattn ccctttnaaa tnccncngaa ancccttgga agcactaccn ctcngacccc 60 

tttggaacgn cgactnctnn atatatcnng gatataatag gtgataagtt ctgncaatta 12 0 

gtaacatcng gaaaaaacag ctnngncctg ggngaaaaag gatgccaaaa tngcctggaa 18 0 

aagagcagng gagaggagtc cgggagatgn gngatgcatc gggacgcanc atngntnaac 240 

attcactggg tctgccaaaa atgtggattt gngggctgct tagatngtta caaggcaaaa 3 00 

ggaaaggaaa gagttctaga gataaaagaa ctatatgctt ggatgaagtg tgtgaaggga 360 

cagcctcatg atcaccaaca tttaatgccc aacccaaaat tataccnggt tctgntttga 420 

cagacttcta gatgccatgc acactcttag ggaaaaaata ttgggattaa ancccatngg 480 

cattggacta acaaacagga atttacaagg tnggaaantt ttncnaccaa tgaaaggggg 54 0 

gatcncaagg ttttccagaa nggntcntaa tcncaggnaa taaaaattnc tctngggcaa 6 00 

gccctgagtc ttaancagca aaaanactcc tcccgaancc tgnagaaaaa agggggggca 6 60 

gccaggcccn naaanggaan gtnaggcccn agatnaacaa ngtnacctcc ncccagnaaa 720 

ccccannccc caactggnac cngggnaacc cacaacnttt gcngaagncc aaaaaagncc 7 80 

nnnagangga aaaaaaaaaa naananaaaa aacctnnnag cccctaagaa accttagggg 840 

nggcccncc 849 



<210> 484 
<211> 1098 
<212> DNA 

<213> Homo sapiens 



<400> 484 

gnnnnnnnnt ttnnnnnttt ttgnaaaanc ccccttttgc naaatngncc ctttttntgg 6 0 

cangggatcc ccatntttat ntcggacatt ttcgggccac cggaaggggc cgggggcccc 12 0 

cgggccncca ggnccgggna aaggcccccc ttgggcggcc cccggncggc cccaatgggt 18 0 

tccaaaaagg gaaaaaaaaa aaagggggaa cctgggaagt tggcccanga aaangnaaaa 24 0 

aaaggnaagn aaaccttccg ccaatgggaa tggggaaaaa taattttttc ttgaaaaacc 3 00 

caaaaaagga atggttattt ttcaaattta aaaaaggaac nttgggaaga aagaattggc 360 

ttcccacncg cagaaagggc attactggct atgtcaagta aaagaagtcc ttcaaagctt 420 

agttgatgat ggtatggttg actgtgagag gatcggaact tctaattatt attgggcttt 480 

tccaagtaaa gctcttcatg caagggaaac ataagttgga ggttctggaa tctcaagttg 540 

tctgagggaa gtcaaaagca tgcaagccta cagaaaagca tttgagaaag ctaaaattgg 600 

ccgatgttga aacggaagag cgaaccaagg ctntgcaaaa agagcttttc tttcactttc 660 

gagaccaaag gggaaccagc tnnaagggcn agaaaagttn gaaaaaaatt ccaaaggaac 720 

tggtggaatc ccccaaaagg tttggttggg gaaagaaaaa ttcccgcccc aangccaaaa 780 

tttaaaaggt ttngccccca aagggaaaag ncttgncctt taacccagga attggggacc 840 

ctgggantta aaaccnataa ttttcccgcc naattnnaaa aaattcnttt nggggncccc 900 

naaaanggna aaaaaatttt nggggggttt tggnaaggna aaaatttnaa atttggattt 960 

ngaaactttt ttngggaatt ccccagaaag aacttttgac cttccnttng acctnaaaaa 1020 

ttttcccttg ggggggtgna anggatgttc ccaagctttg tggnatattg gtaaaatttt 1080 

naaccttttn tncttacc 1098 
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<210> 485 
<211> 798 
<212> DNA 
<213> Homo sapiens 



<400> 485 

gnnnnnnant nnnntttnaa atccttnntg aatcctttga antaccatcc cnttttncga 6 0 

attnggcacg aggaaaggtg gcgcgcttct cacggctgag ttgctgcgcc ttgcagacgg 120 

aagctcccca caggcagagc tgcttggatg tgtgagtcat gaaccagaga agccccgctc 180 

catgagcagt gactccccan gccctgtgac ctccctcctn cttgcagctc ctcctggcac 240 

cagtccccag ggctctcctg ttggtagttc ctgcttttct tcttggaaat tcctcgtgga 300 

cctcgagatc tttaccctaa aatagttctg ttgaatttca ccctggcaat gtaaattgat 360 

agcttatctt cacagatgcc agacaatgga caactcacca tcagtcctct gctcacctga 420 

gacaaatgca tgtctgattg cttcctctgc cctattgntt atgtgaaaat gcagattcac 480 

tgagccagac taaggcatca gtgactgttc ctctacctgc ctctcacatg gagattgtgt 54 0 

attcagtgaa aggctgatca aagacccaaa ggaatgcaac agtttatctc ttatctacct 600 

atgacctgcg aactggccaa caacccagtt gttgncgcct tttcagacag aaccagtgtc 660 

atcttacacg tattnaaatg gatgtcctgg ngtctnccta atatgtattc aaaagcaagc 72 0 

tggggcctng accacccttn ggcacatatt cctcanggac atcattcctg angctgtgtc 780 

actggcatgt ccttaanc 798 



<210> 486 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<400> 486 

gnnnnnnttt gaaanccctt tcnaatnctt ggcattgntc tctttgcagg atccctcgat 6 0 

tcgctgacaa cttgattggg ttctccttca ggtttgaagc gccctcgaga agtgtctaaa 12 0 

ggagacagtt gatagccaaa caacagtttt ggattcactg actgattatg aaagaagcag 18 0 

tagactggta tcaagaatca gtcagcaagg aggccctcac cagacgccag tgccatgttc 240 

ttggacttct cagcctccat attcatgaac taagtttttg gaatccttag gcttccacgt 300 

gtggaaagcc tgagctaacc tactggagga tgagccatca cctggagcag attcaggcca 36 0 

tcctagttga agcctcccta ggccaagcaa ccgtccaact accagacatt gaccattcag 42 0 

ccttgaacat tcagcacaaa gacaaaacag accagaccag aagagtccca cagaataggg 48 0 

gaaactattc agagaaaact taagccacta agttttatgg tgttttgttc tgtagcagaa 540 

gcataggcat actgacaata caaaccgaaa tccttctaac gtagtggacc ttttcangcc 600 

agcatttttt ccttgaaaac ctggagcatg tatccatctt atagcagaga tcactttcac 660 

aatggttggg ctcttggatt tgaattgatg atgtaatgag ccctctttnc ngattgnaac 72 0 

ttaattactc tgggnatttg ntggattccc aaccttctaa tatttacttt tcctctttan 780 

taanc 78 5 



<210> 487 
<211> 797 
<212> DNA 
<213> Homo sapiens 



<400> 487 

ttgtnnnncc cttttnaaat ncctttggct anttgntctn tttgctngat cccatcgatt 60 

cgaattcggc acgagnnngg actaccttnc aaaaccnggt ngggaagcnt gttacagaan 12 0 

tgatntctan tcccctgnat tctggatgct gcagaccaac acctgccnac aanacncana 18 0 
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cacacacann caancantat catgtaagac agnncgntna ntnnnnnatt ntnatncttn 24 0 

nncatttacn cantnttgta nantggntca tgngtctata natnnttgta antattntnt 3 00 

gananangac ganantctga atcttaagca tatgctccat cnttnnatat gctntggtgg 360 

agaggctngc cntnattcat nttnncatgg agncaagttt aatgcctcta gantacattc 420 

tgggcttcaa gcatncttat ttnnaactcc ctgagtgatg ggtggataaa tcnaacattg 480 

nctnagtggn ntcaagacaa ctttgntggt ggttttgntc acaatcatga aaatggttnn 540 

gccagataaa tattttgata ttagntttcn tttttnatat anngcggtag gtttgaattg 600 

nacnttnaaa tgnntngggt tgtnaagaca ntggnttnca atnnaattta tnacatgaat 660 

tggngnctcc cctttggnga aaccttaaag aanttntgna tacttcttca taaaagggtg 72 0 

tgngatttng naantttcgg gggttttnaa tttttnntga agcttatttc ntganaatnt 78 0 

acttggntta ccaagcc 797 



<210> 488 
<211> 762 
<212> DNA 
<213> Homo sapiens 



<400> 488 

caaatcnntt gctctngttc tttttgcagg atcccatcga ttcgcgacag ctctccaata 6 0 

ctcaggttaa tgctgaaaaa tcatccaaga cagttattgc aagagtttaa tttttgaaaa 120 

ctggctactg ctctgtgttt acagacgtgt gcagttgtag gcatgtagct acaggacatt 180 

tntannggcc caggatcgtt ttttcccagg gcaagcagaa gagaaaatgt tgtatatgtc 240 

ttttacccgg cacattcccc ttgcctaaat acaagggctg gagtctgcac gggacctatt 300 

agagtatttt ccacaatgat gatgatttca gcagggatga cgtcatcatc acattcaggg 360 

ctattttttc cccacaaacc caagggcagg ggccactctt agctaaatcc ctccccgtga 420 

ctgcaataga accctctggg gagctcagga aggggtgtgc tgagttctat aatataagct 480 

gccatatatt ttgtagacaa gtatggctcc tccgtatctc cctcttccct aggagaggag 540 

tgtgaagcaa ggagcttaga taagacaccc cctcaaaccc attccctctt caggagacct 6 00 

acccttcaca ggcacangtc ccccaaatga gaagtctgnt acccctcatt tcttnatctt 660 

tttacttaaa ctcaagaggc agtgacaggn agtcaggggc aagacattac atttttcata 720 

ctttcccaca tctgaaaaga tgacagggga aactgcaaag cc 762 



<210> 489 
<211> 822 
<212> DNA 
<213> Homo sapiens 



<400> 489 

ttnnnnnnct nnnggnnttt cnaatncttg tttctcgncc tttctgcagg atcccatcga 60 

ttcgaattcg gcacgaggat tttcgaaact cttcagctac ttgccctttt ttatctgaaa 120 

ccatcatacc ttctgaaaga aaaaagcata tcttcattga cataacagaa gtgagatggc 18 0 

ccagtcttga tacagatggt accatcntnt atatggagag tggcattgtg aagataacat 24 0 

ctttagatgg tcatgcatac ctctgcctgc ccagatctca gcatgaattt acagtacatt 300 

ttttgtgtaa agttagccag aagtcagact catctgcagt gttgtcagaa acaaataata 360 

aagccccaaa agataaacta gttgaaaaaa ctggcaaaat ctgtatacgt ggaaatttac 42 0 

cangacagag actgaagaat aaagaaaatg agtttcattg ccagatcatg aaatccaaag 480 

aaactttaaa gaagatgagt tgtgtaaatg gaactgaagg gagggaagag ctgccttcgc 54 0 

ctggtacaaa gcacacatgt gtatacacat gggtcaagca gtgctggtct gtggctgcct 600 

gtccagagga atgggaaata ttcctttgtc tttagcactt catttttcta aataaaaatc 660 

anccaatatg tctaaaaaaa aantttnttn ataataaacc tngaagccct nttanaacct 720 

tntnntggag gtcctnnttt accntatgat tcccggaact tggataagga atcccntttg 78 0 
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gattgganat tttgggccna aaacccncna nncttggaat cc 82 2 

<210> 490 

<211> 789 

<212> DNA 

<213> Homo sapiens 



<400> 490 

ntgtaancct tttcaaatcc cttggctact tgntctttct gcaggatccc atcgattcga 6 0 

attcggcacg aggccggacn gtgactctgg nnacgcttgc gnccntnacg tagntngnng 12 0 

accntgcang anggaanaan ggctggccnn cngntgtacn ctnaccgtcc taaccccgcg 18 0 

aggtccaggn ccgctccttt cggngnggat tctcgcggaa natccctccg gcagctcttt 24 0 

gcaaagctgn ttagaaactt ctcccaaact cggcntggat acgactgcta tagggctcgc 300 

tgctgctttt gtggagctct tgctcctcta tccttggcct ctcctgggat acggcccaag 360 

gccaagtntt cacgcangtt ggtacgctta tttcgttctg gactctgggg gctntgaann 42 0 

ttcaccacgt ggactgctgg ggancgggnt nccgancact ngnntacctt acnccanaat 48 0 

ctgacaactt ttctggacaa cctacccanc ttcaattggc tngngagcnc ntcngntgct 54 0 

ggggnntncn gtgcaaatgg agncncaatt ggtgggcaaa tngttgatgg ncaaaacggg 6 00 

aaaaagcaac nnncaangct tttggctnaa agccgatang acncaaatta nttnctttgg 660 

accttganaa tttcctcaan nnttttnagn anncnctttt ttncttggan aaanacttaa 72 0 

aagtgaacga ttnttgggaa anaaacaaac tataataact naaagctttt ntaaaaaaaa 78 0 

annaatnnt 78 9 



<210> 491 
<211> 790 
<212> DNA 

<213> Homo sapiens 



<400> 491 

tccaaaatnc ccttggantn attccccctt ncaatacctt tccttngnac actcccngtt 6 0 

tngntngatc ccatcgattc gaattcggca cgaggnaaca aagaaggaat gtcttcctca 12 0 

tgtttnggtc tatagaagac gttaaagaaa acttccagaa agtgggtttg aggcatgagc 180 

caccacgcct ggccaaagga tttaatgaat taatggatgt acagtgctgg ggctgttatt 240 

ctagggcctg cattgagact cacattttgc catcaaaagc cttttaagag gtggaggttg 3 00 

cggtgagctg acatggtgcc actgcactcc ggcctgagtg acagagtgag actctgtctc 36 0 

acaaaaaaaa taatgccctt taaataatga ataatagtga tagaaaatgt catttcttgg 42 0 

acaaatgaaa aattgaaatt aatgtatata attagatatt attagctact cttaggtagc 480 

ttcatttgtt gaaagtttga caagtgaatg aagttcacat ctggaaatcg ttgaacattt 54 0 

ttcgttcatg gaactcaatg gctacgttag tcgtttatgc ttttcactgt tgtggtaggg 600 

gctttggaaa gtnaatgcca tcaacaatgg atacagaang acctggattt ggaataaggg 660 

caaaaattta ttttgatggg gctgaattgc tctgccaggg agcattttgg gtattgagat 72 0 

gaaaatggcc tctctttgag actgagctgc cacctggcaa attattgnct gcttaanggt 780 

tctctttatn 790 



<210> 492 
<211> 804 
<212> DNA 

<213> Homo sapiens 
<400> 492 

tcnaaatccc ttttgnnagn ttcncncttt gtttcccttt nctnggctnc ttgttctttt 60 
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tgcaggaatc ccatcgattc gaattcggca cgaggtcctt ttgaaccacc ccaaagaact 12 0 

caacatggca aagcaaatgg taaaagcttc ccgactgttc tactttgggt ccgcgcgaag 18 0 

cccactcacg tgtgatctgt gttgcccctg ggaggcccgg ggcgaccgga aaagggctct 24 0 

ctcaagttct gaaaagagaa tctgccacca gatcgaattt cgacccctga gcttgttcgg 300 

acgtatggtc caaattcaga ttaaggtggt cacccaaccc gagatgtcag gaaaggcctt 36 0 

ctgcagagaa aatgtccccc cacccgccat ctgcagccag gtgtgtgcca cacggcagcc 42 0 

ttcccgaaac atagtatgga ttttaaaaat gtgtttattt ttgtttctca accactttat 480 

aacgtatttt ttaatttatt ttgtaatgtc ttgttttgaa gtattgctgc tatccttgnt 54 0 

atccttccca ctgtttttat cactgattta ttttgtgaaa agttgtacac taatgttcta 600 

tgtcaaaatc aaaaagtatt taatgaaata ctagttctat ttaatgtggg ntatggaacc 66 0 

ancttggaaa cacaaaacaa acaggggatt gtacaagcan gcttggggcc caagnaaggt 72 0 

caaggttcat ttggttacca tatgccnata aaacctcanc gaanttttaa aaaaaaaann 78 0 

nnnnnnaaaa aancttggng ggct 8 04 



<210> 493 
<211> 800 
<212> DNA 

<213> Homo sapiens 



<400> 493 

ggnncnnttt nccccccttt tgaaaacccc ttttgggnga ancccncttc tttnaaatcn 6 0 

cttggctact cgctctttnt gcaggatccc atcgattcga attcggcacg agtatataac 120 

aacttttgct ttcaaagttg ggtgggacta gaacacacaa tggaaggatg gagtcaggag 180 

acctggattc ttgtgcccgc tctggctttt acagtctgcc taactctatg cagtcacttc 240 

ctgccagcct gtttccttac ctacaagagg gagagacact ccctggccag cctagttctc 300 

agggtgaacg aaaggtcatt atcactgcat cctctagtca tttgcttctt cgctaattaa 360 

cacatcttga gcacctgcga tgttccagga acaggagatg gcagcgtgca agataaaagt 420 

ccctgacttc tagagactgc atgttagtgg caatcggcgt ctacccggcc ttcaataaac 480 

tactgaatga aggaaaattc tacctagcac cagacacaat tactgggttt ctaaaatgga 540 

attattcccc cggccccctg catccagcag cctgctgcag ggaagctcct ccgaagctgt 600 

aggcaggagc gggacaaatg cttgctatca gcttcacaga atgttaccta agtactattc 66 0 

ctacacagcg ccttacagaa caaacagtaa aaaccaaatg gnaagcatgc acnggcttaa 720 

aaactcaaac ttcctaacta ctcagtaatt anganggtca ttttacccca aaatagaatt 780 

ttcnatttat ccaataanaa 8 00 



<210> 494 

<211> 757 

<212> DNA 

<213> Homo sapiens 



<400> 494 

n ggnttcnnt ctaactnaaa cngttnggna actcncctct ntctgtngat cccatcgatt 60 

cgctaacaag cgattctaaa ccacctatga gtatttcttt tagggctcac ttaaatacat 120 

gtttgtatat actgtattct agccagaata attttagatc tgatcaggta gtagctaaaa 180 

ttagaaaaaa acaaaataga tgcttaaaga atttgcatcc atttttgagt ctaaatcttt 240 

taaaatatac tgagatccac atctagtgaa atgtcagtgt caaaatatta tagattatag 3 00 

ctaaaatcca gattaatact catttggggt tttttatagt ggaacttcat agtaatacaa 360 

aaagcagatt gtcttcctgt ctccgctgct cccacagtag gtattgaaac tggtaaaatc 42 0 

agttttttga tagtgtgtgt atataagaaa aaatagatac acacattctt ttttctcagt 480 

caacacattg attgaacact ctggcaaaga tgctgtggtg gatgangttg gagttcgaaa 54 0 

agaagaagca agcgctggcc tgccttgaaa gaacccgaaa gtctttccca ttcacttctc 600 
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tagaaagctg ccaagacaga ngcagaaagg aaatggatga tagttctgtc aagcacactt 660 

ctgntctcnt agaacttaga aatggttcta agagaacaga agttatngag aacagttcnt 72 0 

gtggaattca acatcttggg tgggacncat tggcttt 757 

<210> 495 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 495 

ggnnnnnntc ttttcnaatg cttggctctc gttctttntg caggatccct cgattcgcaa 60 

gagagagtga tagaattggc agtgaaatat acgaaccacc ctcctgccct ctgggttcac 120 

aatacgtgta cacttgactg tgaagtggct gtgagagtgg gtggagagtt cttctttgac 180 

cctcagcctg cggatgcctc tagaaacctc gtgttgattg caggaggagt cggaattaac 240 

cctctgcttt ccatcctgcg gcacgcagca gatctcctca gagagcaggc aaacaaaaga 3 00 

aatggatatg agataggaac aataaaacta ttctacagtg caaaaaatac cagcgaactc 360 

ctgtttaaga aaaatatcct tgatttagta aatgaatttc ctgagaagat tgcatgcagt 420 

ttgcatgtta caaaacagac tacacaaatc aatgcggaac tcaagccata catnacggaa 480 

ggaagaataa cggagaagga gataagagat catatttcaa aagagacttt gttctatatt 540 

tgtggccacc ttcaatgaca gactttttct ccaagcaact ggaaaacaac catgtcccaa 600 

agaacacatt tgctttgaga agtggtggta ggaggcagac aaaggcagaa aaaattaaga 66 0 

ggtgagatct actcaggaga gctcaaaann aaaaaaaaaa aaactnggac ctntagaact 72 0 

atagtgagtc gtnttccgta gatccagaca tgataa 756 

<210> 496 
<211> 744 
<212> DNA 

<213> Homo sapiens 
<400> 496 

ctttnaatcc cttgcactcg tcttntgnag gaccttatcg attcgaattc ggcacgagat 60 

aacacacatc acagtatgct ctcagaaatt tctttatttg aaccctatac caatatctgt 120 

tgatcaatga ccatttttgc tcagcatgga gaaacagtgc cctgcatgaa gggtagtgag 180 

aataaaaagg atcttaccac ctttatcatg agggtggctt tgctctctcc attccaagtt 240 

gttctctgtt ctagaaagca gatgtagtag acatctactg tttttgccta aacagaatcc 3 00 

ctttttcctt tttttgttaa aagtactcat ccctaatatt acattgttct ggaaggactg 360 

aaaataacag aactcagcac catgatcgga ccgggacaat cagattattt cattcctcag 420 

caaacggaga tcgatccgaa aagtggaaat atgagctctt ctttggtgtt ggcatatgga 4 80 

ccctgagaga aagaacttta attttttctc ttggactgca ataaagtata gctgcctaaa 540 

ataccgtttc ctgacacttg gaggtttgcc acaatcggtg aaataaaggc aagacgtaac 600 

actggatgaa aaaaaaaaan nnnnnnaaaa aaactcgagc ctntagaact atgtgatcga 660 

ttcgtagatc cagaatgata gatcattgtg agtttggaca accacactng atgcagtgaa 720 

aaaatcttat tgngaattgn gatn 744 

<210> 497 
<211> 772 
<212> DNA 

<213> Homo sapiens 
<400> 497 

gnttgnngtn taantttnta aggatccctt tntntgaanc cctttctgca ggatcccatc 60 
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gattcgaatt cggcacgagg caggagnaat cacttgaacc ctggaggttn cggttgcagt 12 0 

gagcacagat catgccactg cactccagcc tgggcaacaa aacgagactt cgtctcaaaa 18 0 

aaaaaaaaca tagaatttgg atcctttggt cgggttctcc caaattcttt tgaggtgtcc 240 

atggtcaact gcttcagctt tgttttggca accccctgcc cgaagtcgca tataggctgt 3 00 

tcttcacctt gtttccaagg ctgaggaaca gaaagtagcc tctgttttga ggaggtggaa 360 

gttaagtata catttatttt ttactgtgac ttgttcagga ccacatttta caaaatgcct 420 

tgtttccttc attgtttctg gaaaggaaag ttctattaat attgntttac tttgaatata 4 80 

gaatagtttt tttaattagg gcttattttg aaaaattctg agtttaattc aaatgtatgc 540 

caataccttc caaagtaagg taatattcag agacagttgt tggtgatcag atggcttaga 600 

gaaaatttct ggaatattca cattcgaaga tccttattat gaatgtcttt gacttaaatc 660 

taaccaaaaa ctgcacatta ttctttgnac attttcatta tatagngtta acaagcttan 720 
ttgcaaacca ataaatactt aagctattta aaaaaaaaaa aaaaaaactc nc 



772 



600 
660 



<210> 498 

<211> 773 

<212> DNA 

<213> Homo sapiens 

<400> 498 

nttnagcnta nnagccgttg tantgaagcc cntttgctac ttgctctttt tgcaggatcc 6 0 

catcgattcg aattcggcac gaggacccag gtagaccagc tcaagagttc atgttctttg 12 0 

tcatcctcct gtgagctctc tgtaagtctc tntcttgccc atcaccacat ccctagtact 180 

gggtatcagt ctggccactt ggctttctgg tttgccccaa tgtggtctat tcttgatgca 240 

gctaccaaag taatgttnta aaaccattat accaagttac tatccttgtc aaaaccccca 3 00 

gtaactgcca atctcactta gaataaaatc cggactcctg tgaagcacag nataaactgg 360 

cactgcctat gcagcaacct catctttacc gtttctgcct tgctcactcc cttcagcgcc 420 

ggtattcttc ctgatgcccc tagtacacaa caactccttc ctgctccaag agtaggaaaa 480 

tnactgtctc tctgccagtg agattcctct tctggtatta cctntgcttc attgctgaat 540 
cttctgcaat atcatcttct aaaaagagcc tttnaaaatc accttttcta ttatgcccta 
ctcantttcc agtccctgaa tggccattcc ccactttcat agccacttaa ttgctatctg 

aaattacact taaaatggtc accttcatga tgggaaggca attaattgcc tttgtcactg 720 

gtatgtctag agaacaagca gnttggctca tagtaggcac tcaacaaaaa ttt 773 

<210> 499 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 499 

gcttcaatan ctttttctaa ngctcttttt gcaggattcc atcgattcga attcggcacg 60 

agagtaccca nanttgcnag gagtntnntn actgatntag ccaggtggca atnatgagtg 120 

aatggatnaa naaaggcccc ttagaatggc aagatnncat ttacnnagag gtccnagtgn 180 

canccagtga cangaatgag tttnaaggga tgggttttaa ctacagaccc agnctctgcc 24 0 

aatatngacc ttgtgaactt ccttgaagat ggcancatgt ctgagaccgg aattatggga 3 00 

catgctgtgc agactgttga aactntgaat gaaggggacc atagagtgag ggataagctg 3 60 

atgcattttg ttcacgtctg gagactgcaa agcatacagc ccacaggatc tggaagagag 420 

aaagaacagc ctanagnaaa tggctngaga ngaaccacat tcccatcact gaacagggan 480 

acgcttcaag gactctctgt gtggctgggg ncctgactat ngacccacca tatggtcana 54 0 

naaattncac cagctctnat gagantattn tgtcgcgtgt tcaggatctt antgaaggac 600 

atcttacant ttnccaanna naagncatga aatgtgacat tctgcttgaa naagacnata 66 0 

ttttatcctc atnaatgttt aaatgtaaaa nnnnananaa aanactcgag ctntnaaatn 720 
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tngtgagttn anang 

<210> 500 
<211> 926 
<212> DNA 

<213> Homo sapiens 
<400> 500 

tttaagccct ttctactnct cttttgcagg attccatcgn ttcgaattcg gcacgaggat 
ctctatacta gtgaacagtg ccagttccac actttggact tagaactgtt ctctagttat 
tgtaacacag aatactgtca atccctaatt tacttaatgt tacttattgg aagtggggct 
gatgaaatac gcacaggagg gaaatctact gtgtttaggc acaggcagnc ccagtgtata 
aggagatcat attccaaang gttgtcagtt ggntgtttgc aacctggaat gtattttcct 
ttagagacca ngttatccat ggtggttagg cccctagagc agctggaaaa agatgatcaa 
accaataggt tngctgacat cnaataatgt aataagtttg ctaaaggaat ctaccatcaa 
atntnatatt gnttccaggg aaggttgtnn nttaanntnc cntcttngtg ncatantgga 
cnntcccntn ccagtcatnt ncntnannnc tngggcnngt ntngnnttng tntntttngn 
cnnctnanca atatttcata tcncccctng ctaaaattct ttnanannaa nttctcantt 
tctcccttta ctanaanttt ngtntttnnt ccntttanta tttnnnccta tntntntcgt 
tcnnanatnt cattnnntnn ttntnngctn ntnnatcacc cttanctcnn tctcanntat 
cntnntcnta ttatctctnt attnntcnct tntnatnatc nttccnnntt gtntanncna 
ttatntcttg ttnntntnct cncatctctn tcntnttctc ngctnannnn actccnnnnn 
tcncnctcnt nnnnanatnc atatnctnct ttngntatat annnnnntnt ntacntanct 
cnnnatnnca tnncnatatn nttngt 

<210> 501 
<211> 706 
<212> DNA 

<213> Homo sapiens 
<400> 501 

naatncttgg ctcttgttct ttntgcagga tcccatcgat tcgaattcgg cacgagaatg 
caaagggctg cagttctcat tcaggctact ttcaggatgc acagaacata tattacattt 
cagacttgga aacatgcttc aattctaatt cagcaacatt atcgaacata tagagctgca 
aaattgcaaa gagaaaatta tatcagacaa tggcattctg ctgtggttat tcaggctgca 
tataaaggaa tgaaagcaag acaactttta agggaaaaac acaaagcttc tattgtaata 
caaggcacct acagaatgta taggcagtat tgtttctacc aaaagcttca gtgggctaca 
aaaatcatac aagaaaaata tagagcaaat aaaaagaaac agaaagtatt tcaacacaat 
gaacttaaga aagagacttg tgttcaggca ggttttcagg acatgaacat aaaaaaacag 
attcaggaac agcaccaggc tgccattatt attcagaagc attgtaaagc ctttaaaata 
aggaagcatt atctccacat tagagcacag tagtttctat tcaaagaaga tacagaaaac 
taactgcagt gcgtcccaag cagttatttg tatcagtctt attacagagc tttaagtcca 
aagatatcaa atatgcacgg gctgcacact aatcagtctt ctatca 

<210> 502 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 502 

ttnntttttt tggttaccct ttgctctngg nctttttgca ggatccctcg attcgaattc 



735 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
926 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
706 
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ggcacgagcc ttccacggtt atttcacaga tatggagagc tggaagcagg gagtgagtct 120 

ctgagtgttg gaattgtaag ggatcagaag cagggatcag aagcagtggt gaagttcatc 180 

caccataaaa cacacaggtg actttgcctt gaatctgcag gactgaagcc aactcttggg 24 0 

cacagaccct tagtcccttc cttggccact ctaagtcaga tagtccagag ccaggccctt 3 00 

tgggatgtga caccgagata aatcagagaa aagctgtgaa gcttggggaa cagagggact 3 60 

tttggtgaag taggtggtct gcagtttcta tcttcttggg aaaagcaagc tggaaaagtg 420 

aacagtggtt ggtaggccat agtgctccca gctgggtgac ataatgacca cacagcacag 4 80 

tgatgttatt agcaactgtg tggtggagta gttgtgggct ggacaaatca atcgtgtgga 54 0 

aattgttagg agttttatta cattaaactt gttaacctaa aataccatca aaaaaaaaaa 6 00 

ntncnnannn nccncccacc nancntncna aaaaaancct cganccttta aaaacnnntn 660 

gnngaggccn tatttacgtt anattccaga cnttgaatan ggatnccatt tgnattgaaa 72 0 

ntttngggcc aaacccccaa ccttngaatt gccattngaa aaaaaaatgc cttttatttt 780 
gnnt 



784 



<210> 503 
<211> 764 
<212> DNA 

<213> Homo sapiens 
<400> 503 

ttnntnttcc ttgaancctt tttctacann cncctttgca gatcccncgt tcgaattcgg 60 

cacgagagac aaagaaaagg tggcaatcat agaagagttt ntagtaggtt atgaaacctc 12 0 

tctaaaaagc tgccggttat ttaaccccaa tgatgatgga aaggaggaac caccaaccac 18 0 

attactttgg gtccagtact acttggcaca acattatgac aaaattggtc agccatctat 240 

tgctttggag tacataaata ctgctattga aagtacacct acattaatag aactctttct 3 00 

cgtgaaagct aaaatctata agcatgctgg aaatattaaa gaagctgcaa ggtggatgga 360 

tgaggcccag gccttggaca cagcagacag atttatcaac tccaaatgtg caaaatacat 420 

gctaaaagcc aacctgatta aagaagctga agaaatgtgc tcaaagttta caagggaagg 4 80 

aacatcagcg gtagagaatt tgaatgaaat gcagtgcatg tggttccaaa cagaatgtgc 540 

ccaggcttat aaagcaatga attaaatttg gtgaagcact taagaaatgt cattgagatt 600 

gagagacttt tataggaaat cactgatgac ccagtttgac tttcatacat actgtatgan 6 60 

ggaanattac ccttagnatc ttatggtggg actttattta aaaacttnca nnaatgttcn 72 0 

ttcgacagcc ttccatttta acttcnaagg cnncaangaa ttnt 764 

<210> 504 
<211> 795 
<212> DNA 

<213> Homo sapiens 
<400> 504 

ttgtacntct ttttnnaaac ccntngctac ttgttctctt tgcanggatc cctcgattcg 60 

ggaatctcct agaaagttgt gattttcgag ccatatcctt ctgtggtaga tcctaatgat 120 

cctcagatgt tggccttcaa ccccaggaaa aagaactatg atcgagtaat gaaagcactg 180 

gatagcataa cttctatcag agaaatgaca caagcaccat atctggaaat caagaagcaa 240 

atggataaac aggaccccct tgctcatccc ttactgcaat gggttatatc aagtaataga 300 

tcacatattg tgaaactgcc agttaacagg caattgaagt ttatgcatac tccacatcag 360 

ttccttcttc tcagcagtcc accagccaaa gaatccaatt ttagagctgc taaaaaactc 420 

tttggaagca cctttgcatt tcatggctca cacattgaaa actggcactc ctcctganga 480 

atggtctggt ngttgcttct aatacacgat tgcagctnca tggngcaatg tatggaagtg 540 

gaatctatct tagtccaatg tcaagcntat cattttgntt actcagggat gaaccangaa 6 00 

acagaaaggt ntcagcccag gacgagccac cttcaagcng ttaanaagcc agcaattaca 660 
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ttcacagtcn ccaggaaana aaaggncagn cctatccccc ctttncctgg caaaaggccc 720 

gtnaacctta aanaaactgc ctttagccct ttatnntgga aagtggattc ncncttnatt 78 0 

cttggacccc tgncn 795 

<210> 505 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 505 

tnntntnntt nantngaacc ctttnctctt gctctttttg caggatccct cgattcgaat 60 

tcggcacgag cacaaggaga agaagttaat taacattgaa ngatgagaag acatcttgga 12 0 

agaacttgaa ttgggccttg gaagaagaac agccattcaa atagatagaa ttgtggtagc 18 0 

aaaggcatag aggtaggaaa gtatagatct ccagggacag tagtcatggg gttggggcac 240 

tgttggaatt taaggttgga aggatatatt ggagcccctt gaatacggta acaaggcaca 3 00 

ccttgggcag tggagagtta tcagagtgtt tgaaaaggag ggttattgag taaataaata 360 

gactggtact ttaggaattt taaaatgtgg atcattgtac tactaataac tatttatttt 420 

atatttacta tctactaagt aatttacatg tattttcttg tactgactgt aaaccttctg 480 

ggtgtgggtg ttttaagtgc cattttactg atnaagaaac tgaggcttaa atagttgaaa 54 0 

taagtcaccc tgttagtgag tggccagaat gacaagtcag atctanggtt tgtctaactn 600 

ccaaagatria tataaaaata atggatctct ccttttccct tatgcataaa atatggggag 660 

cntttttaaa tcattaccca tncgattgnc caaaaaaata cctttnggga aaactgatta 72 0 

ttantattcc anaataaatt tcaacggcct gcntngnctn ctttacaact ttnt 774 

<210> 506 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 506 

gccnccccnn tttngntctc aacttgtacc ctttttgcan nancncgnnc tncttgcagg 60 

ntcccatcga ttcgaattcg ccacgangtt atattaaatt attctttgtt tttctttttc 12 0 

ttttaataaa gcctgcaagt tactaaattg tagtttcata aattctgtag taaagtatca 180 

tcttggcagt gtgccaaagg tgaaaatgat gctttctcta acagagaaat tcttagtgac 240 

tccagtcgta gaaaaacgtc tttacaacct gaataagatt gaagaattgt gaacatacca 3 00 

tggcctattg gatgaatcat ttgccgtagg ctaaatcaga ctgtagggtt tgtgatggat 360 

ttatggagta tgtgggtata gaaatcatga atctagcatt tgttttcaga gattcaagca 42 0 

tagtcttaag ggtanatcag aaatgacaaa tgaattcaaa acctagcagg tgcattgtna 480 

atgtgtgccc agttntgttt tggaaatggc agttccttgg ggtcatgttt ctactggcaa 54 0 

aatttgcaat antgtnctat tgtntgtaat ttcaaaattt ataagattat cccccgttcg 600 

cccaagtaaa acctgtnctg cccaatanaa tcctggantc gnngagaaat cgcntccatt 660 

cgnngntcaa ctcgggatnc ntcgncttaa naaaatnttn tccnggancc ccntcatnan 72 0 

gaanaacacc anactattnn gggnacctgn aangctcaat ngcccnngcc ncnnangncn 780 

nttttccngg naannn 796 

<210> 507 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 507 
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600 
660 



ctnntttntt ttngaancct tngctcttgt tctttttgcg gatcccatcg attcgtgaag 60 

aggagacggt gacctgggct ccttatgtgc ctgaaagagt ttgagtttcc tgttaactcc 120 

aaatcaacag tattttcaac aagaaatgtg caattgaaat caagtgctgt ttaagtgcag 180 

ctaggatttc cacaggaaga cacttgcagt gaacagagtt atggagcagc aaaaacacag 240 

atctatttgg aaaaagagaa aacatatgcg ttgtattttg cttcaattat aaaataccat 3 00 

cctctcaaag gtggttctaa attacaaagg actttgattt ctaggtagat tctgggtaga 360 

gacttccttt catattgagg cattaatgac accttttaac ctgggaagca atatgactgg 4 20 

agttgtactt tgagaagatt aatcaggttt ggttgcagaa tgaaagagaa gatgaagtca 4 80 

agagattggt ttagaggctc tagcagaagc ttagtcatat ttcaaaatga tcaaatatca 540 
agaaaaattc tgagctgcat aacttgtata aagtaatttt cagtgatttt ttcatggtta 
tgatnaaaga actggattta nccagaaacc tttacctgga ttcaagattt aatttttcct 

ttgagcctca tccttaaagg attttcggga aaacattaag gggagccaaa nccnattggn 720 

tggttgggcn tgccctnnaa ttgcctttgg acttttttaa ccgggctttt gnnn 774 

<210> 508 
<211> 724 
<212> DNA 

<213> Homo sapiens 
<400> 508 

cttgcctttg aaaancgttg gctactngtt ctttttgcag gatcccatcg attcgaattc 60 

ggcacgaggc ggcgctgacc cggccggccc cacacccgct cttcctcttc tttgccgcgg 120 

actccctttc ctgcctccaa gacctggtgt ctcccactgt gagcccagct gtcccacagg 180 

cagtccccat ggacctagac tcaccttccc cttgcctcta tgaacctctg ctgggcccag 240 

cccctgtccc agctcccgac ctgcacttcc tgctggactc aggcctccag ctccctgccc 3 00 

agcgagcggc ctcagccacc gcctcccctt tcttccgggc cctgctgtca ggcagctttg 360 

cagaagccca gatggacctg gtgcccctgc gaggtctgtc gcctggtgca gcctggcctg 420 

tcctgcatca tttgcatggt tgtcgggggt gtggggctgn nntggggccc gtgcccacac 480 

cangcnancc cctgtatggg atcanaggcn cgaagangca ntgnangctg ntggcanntn 54 0 

aantactgnc tgggctggaa nangaactnn taaaagtcnt ngcccnnatc caccttggna 600 

cccnannttn nnccrintant cnnngggntn angtggtnnn nnctngggac agntcnntnt 6 60 

ggnntgncna tngnncnnat gnanacttgg ggttcannaa ncntttccnn atgnaancng 720 
ngtc 



724 



60 



<210> 509 
<211> 803 
<212> DNA 

<213> Homo sapiens 
<400> 509 

tnnnnnttta tttcnttcgt tctngntttt attacatcag ctcttttctt tttgcggtcc 
ctcgttcgca attcagagac acacataaga aactggaaga agagaaaggc aaaaaggaaa 120 
aagaaagaca ggaaattgag aaagaacgga gagaaagaga gagggagcgt gaaagggaac 180 
gagaaaggcg agaacgggaa cgagaaaggg aaagagaacg tgaacgagaa aaggagaaag 24 0 

aacgggagcg ggaacgagaa cgggataggg accgtgaccg gacaaaagaa gagaccgaga 3 00 

tcgggatcga gagagagatc gtgaccggga tagagaaagg agctcagatc gtaataagga 360 
tcgcagtcga tcaagagaaa aaagcagaga tcgtgaaagg gaacgagagc gggaaagaga 4 20 

gagagagaga gaaccgagag cgagaacgag aacgggagcc gagagagaga gcgagagagg 4 80 

gaaccgggag cgagaaagag aaaaagacaa aaaacgggac ccgagaagaa gatgaagaag 54 0 

atgcatacga accgaaaaaa aaaaaaaaaa aactcgagcc tnttaactat agtgagtcgt 600 
attacgtaga tccagacatg ataagataca ttgntgagtt tggacaaccc ccacttgaat 660 
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gcagtgaaaa aaatgctttn tttgtgaaat tttgngatgc tnttgctttt tttgtaacca 
tttttagctt gcaataaaca agtttrtccac caaccanttg cnttcatttt nttntttcan 
gttcaagggg aagtttttgg aag 



720 
780 
803 



<210> 510 
<211> 789 
<212> DNA 

<213> Homo sapiens 
<400> 510 

gntttnnnnc nnttttaatn tacatacanc tacttgttct ttttgcaggg atcccatcga 60 

ttcgaattcg gcacgaggga acccccacca ttaagctaaa gtaaaaccct tttgagggaa 12 0 

gagggagact ggggagaagg gaaaagagag aaggcaggga gagtagggag agaaaacctt 180 

ccagcagccc agtaaactgc gggcgaagag atctacccgt ctccctccct cccacagtta 24 0 

ccattggcct tgtcatcgca agcatttgac aaagacttgc ttgtttgggc ctgtcacctc 3 00 

ctgaaaggct gctttagctg tggatgccct tgattaaggg agagagcgcc taggagctgc 360 

ctgccccanc tggggtgacg gctgtagggc tgggtctatg ttgcaagccc tatatcctan 420 

catgcagtgg aaagtgctta gctctctccc tcctgacctc tgggcagcca gtcatcaaag 4 80 

cagagagacg tggcggcatg tgggcagcat gcccaggttc cttgctgact cagcacttat 54 0 

ttctgtagtt ttaaaaaaga atttaatgtt tttggttgta tttttttggg ggggtgaggg 600 

tgggcaaaaa catgggggta gttctgagtt gttagaaatg tttctgaatc aagtttgttt 660 

gaaaacacgt tgtgcctttg tacccattat aagatggtca taanacccaa gaactgataa 720 

gctttgggtt ttttttggtt tggtttggtt ttttgcttca ttttacccat tcatgcctag 780 



<210> 511 
<211> 776 
<212> DNA 

<213> Homo sapiens 
<400> 511 

catanagntc ttgccttttt gnaggacnct cgattcgaat tcggcacgag cccccatctt 60 

cactggttat tccacttatt taaaatgtcc agaataagca aatctccata tagaggaagt 120 

agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc ttttgcagtg 180 

ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata tgcttaagac 240 

cattgaatta cacactttac gttggtgaat tgtatggatg taaattatag ttcaataaca 3 00 

tagttacaaa agataatcaa aagcatgaaa gcactgttga tgtggnttgg atctgtgtcc 360 

tcaccgagtc tnatgttgaa atgtaagccc cctggtggga ggcgatggga ttatggggca 420 

gantcctcac aaacgggtta gcccacccgc tcaggctgtt ctcctgatat tgagtcctca 48 0 

tcacatctgg ttgcttcaaa gtgtgtggng ccttccctct atctcctact gctctggcca 540 

tataagangt gcctgcttct ccttcgcctt ntacatgatt gtaaagtttc ctgagcctcc 600 

tagaacnaaa gctgctgngc tttctgtcca tctacangan cgtgagccca attaaacctc 660 

tttttttttt ttnngaggnn ntttnntnnc nntccnnnca ntttnanann cctngnanng 72 0 

gtttnnaaaa anaananngn naannnnnnn nncccccngc ccttttaaaa taaaaa 776 

<210> 512 
<211> 917 
<212> DNA 
<213> Homo sapiens 

<400> 512 



ggtttccat 



789 
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ttatttcata aactattgtt ctttttgcag gatccatcga ttcgaattcg gcacgagggc 60 

tgcgaggttt tcggctttgg ctcctgatat gcagcgacag aattttcggc ccccaactcc 12 0 

tccttaccct ggtccgggtg gaggaggttg gggtagcgga agcagcttcc ggggaacccc 18 0 

gggcgggggc ggaccacggc cgccctcccc tcgagacggg tacgggagtc cgcaccacac 24 0 

gccgccgtac gggccccggt ctaggccgta cgggagcagt cactctccgc gacacggcgg 300 

cagcttcccg gggggccggt tcgggtctcc gtcccctggc ggctaccctg gctcctactc 360 

caggtccccc gcggggtccc agcagcaatt cggctactcc ccaaggcagg annanaanca 42 0 

nccncanggt tntncaagga catntacacc atttggatca nggcgtntta naaaaaaaan 480 

aatgttaatg anttggaaaa ntatttnaaa gcctttnaat gnttnnnnna atccttnggg 54 0 

nttggcctta naaanccaan attntngtng gngggntntt aannccnnnc aantncnnnn 600 

nnattncntt naaaacnttt nnnccanggn cnnaaaaaaa nggggnaann aaaaaacttt 66 0 

tttnnttnaa nnantttttt tggaaaattt naaancntng gaaaancntt tnnntngttn 72 0 

ntnangggaa annantnttt tgggnncnaa aaaacntttt naannnntnn nggttnnnan 78 0 

nnnttaaaaa ntttnnnccc ccaannnnnt nnanngnanc ttttnnantt ngggantaaa 84 0 

nttnnnnnna nggggnnttt tttnngnnna atttnrmnnn annnnnnnan nnangggnnt 900 

ttngnnngna annntnn 917 

<210> 513 
<211> 780 
<212> DNA 
<213> Homo sapiens 

<400> 513 

tnnnnnnttt aaatccatta gctacttgtt ctttttgcag gatcccatcg attcgtgcgg 60 

gagcacccga gcctgcggct ccagacggac gcccgcaagg tgaggtgcat cctgacaggt 120 

cacgagctgc cctgccgcct gccggagctc caggtctaca cccgcggcaa aaagtaccag 18 0 

cggctggtcc gcgcctcccc ggccttcgac tatgcagagt tcgagccgca catcgtgccc 24 0 

agcaccaana acccgtangt ggtccncggc ggcgcgggga ggcccagggc aatnngacag 3 00 

nccctccgnt tgactccgcc agtgctgcag nccctactct ttcanagttg ggagccctgg 360 

gacccaggca ccaattgttc ttgcaaactc accctgcggc acatcaacaa gtgcccanaa 42 0 

cacgtgctga ngcacaccca aggccggcgg taccagcgag cttttgtgta aatatgaaga 480 

atgtctnaag caaggggtgg agtacatgcc tgctgcctgg tgcacccgan gangaagang 54 0 

gaaggacaaa tggacngtga acggccttcg cccgcgggaa agcttctggg agcccacatt 6 00 

caatgatgaa gggggagctg caagtgatga cagcatgaca gacctgtncc cctgactttt 66 0 

caccagaagg accttgaaca cngaggatgg ggatggactg atgatttttg acaacaaaga 72 0 

ggttgaaagg caaancccca aaaaaaaggc cttgtgaagg cagganaaan acaacctntc 78 0 

<210> 514 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 514 

tttnnnngnt ttannncatt ttgctactng ttctttttgc aggatcccat cgattcggaa 6 0 

ttatagtatt gacgtgaatc ccactgtggt atagattcca taatatgctt gaatattatg 120 

atatagccat ttaataacat tgatttcatt ctgtttaatg aatttggaaa tatgcactga 180 

aagaaatgta aaacatttag aatagctcgt gttatggaaa aaagtgcact gaatttatta 240 

nacaaactta cgaatgctta acttntttac acagcatagg tgaaatcata tttgggctat 300 

tgtatactat gaacaatttg taaatgtctt aatttgatgt aaataactct gaaacaagag 360 

aaaaggtttt taacttanag tagccctaaa atatggatgt gcttatataa tcgcttagtt 420 

ttggaactgt atctgagtaa cagaggacag ctgtttttta accctcttct gcaagtttgt 480 
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tgacctacat gggctaatat ggatactaaa aatactacat tgatctaaga agaaactagc 
cttgtggagt atatagatgc ttttcattat acacacaaaa atccctgagg gacattttga 
ggcatgaata taaaacattt ttatttcagt aactttnccc cctgtgtaaa gttactatgg 
tttgggggta caacttcatt ctatagaata ttaagtggga agtgggtgaa ttctactttt 
tatggttggg gtggaccaat ggctatcaag agtgacaaat naaggttaan ggatgattcc 
caaaaaaaaa aaa 



540 
600 
660 
720 
780 
793 



<210> 515 
<211> 770 
<212> DNA 

<213> Homo sapiens 
<400> 515 

cttattncat nnagctcttg ttctttttgc aggatcccat cgattcgaat tcggcacgag 60 

gttgtattgg aaagcagtag tgtggacgaa ttgcgagaga agcttagtga aatcagtggg 12 0 

attcctttgg atgatattga atttgctaag ggtagaggaa catttccctg tgatatttct 180 

gtccttgata ttcatcaaga tttagactgg aatcctaaag tttctaccct gaatgtctgg 240 

cctctttata tctgtgatga tggtgcggtc atatttatag ggataaaaca gaagaattaa 3 00 

tggaattgac agatgagcaa agaaatgaac tgatgaaaaa agaaagcagt cgactccaga 3 60 

agactggaca tcgtgtaaca tactcacctc gtaaagagaa agcactaaaa atatatctgg 420 

atggagcacc aaataaagat ctgactcaag actgactctg atagtgtagc attttccctg 480 

ggggagtttt ggttttaatt agatggttca ctaccactgg gtagtgccat tttggccgga 54 0 

catggttggg gtaacccagt gacaccacac tgattggact gccctacacc aatcagaact 6 00 

cagtgcccaa tgggccactg ttttgactcg gaatcatgtt gtgcactata gtcaaatgta 660 

ctgtaaagtg gaaanggatg tgccaaaaaa ttaaaaaaaa ccnccaaaaa agcttccaaa 72 0 

aaaaaacctt taaactatag tgagtcgtnt acntagatcc aacatgataa 770 

<210> 516 
<211> 825 
<212> DNA 

<213> Homo sapiens 
<400> 516 

tttccagttt tanttttttc ancttttnga tcnntttgca ggatccntct tttcgaattc 60 

ggcacgagat tctccctaaa ttgtngatcc cactgtttac naaactgttc tnttgtgctg 12 0 

gcntgctnan tgctntgtag nncctttctg nacnntaggc attgctcttg gagaacnnga 180 

tgtgctttnt ntnaaanggc anaccagngn tgnnctgnnt ttaatgatgc agancctnac 240 

tttatccaca cctggcccgt ttnacatttn agtaangnac gatatttggc tgatggctga 3 00 

acantttctg aaatacacnt ttagtgtatg gaantacaag accnntaaag gnctgccagg 360 

ttancatctc atctngcatt cnnntccttt ggcnanaaag gganatntca gaattatatt 420 

tcttgatggg gtcttttcaa tcantgtatc tgtcgaaann tcttaganaa anctatgtgn 480 

tcncggtgtt gtctaaaaan atnctttcaa anatgacccc tggaattncc tgananangc 54 0 

ttaaacgtga gaagacnggt nggcaaaaca ccctncnaag gttnttggna angcccnant 6 00 

ntgttttgtc tggcccatat aancttngcn ccattnaagc cncgggngag ctttgnatnt 660 

atattngngg ngttactttc tttgnncctt tgcggggaac ancttnnata atgcttntcn 72 0 

ncccnanntg gacntttgct ttttgnnncc nnaccccccc aaagggngcn cacctccant 780 

gaaaaagtct ttttnnaaaa gggctccttn ctnaaaaaaa nnnnt 82 5 



<210> 517 
<211> 1444 
<212> DNA 
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<213> Homo sapiens 



<400 
ctctcncnnc 
nnncnnctnn 
ccnctattnt 
tnntntactn 
actngttcca 
aattncccnn 
ctaatgntta 
agngggctna 
cggnnncccc 
ncgnnnnnnc 
tnntnnaacn 
tncttnttnn 
tatctnctac 
ncccctctnn 
atnncnccta 
nctnntcttn 
cttcctctcn 
ananntcctt 
ntnctctcac 
tcttcttaaa 
ctcgnaaatc 
ngngcanaat 
aacaggnacc 
cncctntgta 
anat 



> 517 
nnnncnnntc 
nnnnncntnn 
cntnntnntc 
tcnntnntct 
agtctcntan 
gctnttatan 
gatccngnta 
ggcanngnnn 
tgaattttnn 
gnntarmccn 
nnnttantct 
atcttntnct 
nctctnttcc 
ctcttacctn 
tcctctactt 
tnctatttnt 
ctctcctnnn 
antttcctnc 
tctaccactc 
cntctcctct 
tctccaatnc 
nnaacncnan 
nccctntntc 
anagcatctc 



tctnncnntn 
cntccntctc 
nntcntcnnn 
ggctnttnta 
cngcntctnt 
aagngngnaa 
gaanncgnta 
tntgcaaann 
ancnngganc 
canngttaan 
annatnntta 
tctcncttna 
ncttctccct 
nntnctctcn 
attctcctcc 
actntcncta 
nnnactactc 
tcatcacant 
tntnctccac 
tatcnctcta 
tnccccactc 
tccnctccct 
tcnaaatcct 
nctntccncc 



nnnnntnntn 
ttntntngct 
antnctnnnt 
tntggggggt 
ctnnctntct 
ttaaggtntc 
catgttggga 
ctcnnntntc 
nttaaatnnt 
tgcncccnna 
cnnntnagnt 
tttntatctc 
nncntctctc 
tcntatctcn 
tattntncca 
ttcctncntc 
tcaccntctc 
actcttccct 
tcatatnana 
anctcctctt 
taaaaatnnc 
ttagctatct 
catnctncta 
aatnnanatc 



nnnctcnnnn 
ctcntntctc 
tctncctnnc 
ctatttnttn 
ntgcnctncn 
nnntctanng 
acngacanct 
nnancttgnn 
ntngnggtac 
nnnantcaac 
tttcctcnct 
ritntntntnc 
atcatatccc 
nnaccctctt 
ctcacccttc 
tctnntgnct 
nnctntcnct 
ctcatnntca 
cttctatant 
cntcgctanc 
ncntcngant 
ctctanaaac 
ctttatatnt 
tccctnctcc 



cnnnatctnn 
ntncatcttn 
canctntcca 
ncttaaatcg 
ctggggcntt 
ctntgcaagg 
tnctgcncaa 
tcncgtagan 
gannccncnn 
tctntnntcc 
nacnnctctg 
tnccctnatc 
acgcnactna 
tctntntctt 
ntntntctnc 
cccaccccct 
ctacnnntnn 
nanctaantt 
nctaatccta 
tccnntncaa 
cccacttttc 
cccntttctc 
cnccaagcct 
natanatntn 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1444 



<210> 518 
<211> 706 
<212> DNA 

<213> Homo sapiens 



<400 
ctaatggctg 
ccgaagaaaa 
caagtttgcc 
aatgtattgg 
gcttcctaca 
cactctctga 
cttctcagtg 
tagtgcgaga 
tcagtgcatt 
ccaaactgaa 
agctccttga 
tcattgtggg 



> 518 
gnngctcgtt 
agactgtggt 
ttctcctatg 
aatggaacta 
gcgcctaact 
tcctgtggaa 
ggaacggact 
tgaccttgga 
tcatgctgaa 
attctggggg 
acacaatgag 
taaactgtgg 



ctttccgcaa 
ggcggagatg 
ttttccagaa 
tccaagatca 
gaatacatgg 
aggatgcagt 
ggaaaacctt 
tttagactca 
ggattaaaca 
aagagtgtag 
gcatatacat 
atcgaacagt 



cancccngcg 
ctctctccaa 
atgacttcag 
cgatgccagt 
agcatactta 
gtgtagctgc 
tcaacccact 
tctccgaaca 
atgacttcat 
aagcagaacc 
ggacaaatcc 
atggcaatgt 



antcgaattc 
tggcatcaag 
tatctggagc 
tatatttaat 
cctcatccac 
gtttgctgta 
gctgggagag 
ggtcagccat 
ctttcatggc 
caaaggaacc 
cacctgctgt 
ggaaat 



ggcacgaggt 
aaacacagaa 
atcctcagaa 
gagcctctga 
aaggccagtt 
tctgctgttg 
acttatgaat 
cacccaccaa 
tctatctatc 
atcaccttgg 
gtgcataata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
706 



<210> 519 
<211> 734 
<212> DNA 
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<213> Homo sapiens 
<400> 519 

tngtaccaat tatctgctgg ctanntagcc taaanagntt ggtcngggcg aattcggcac 60 

gagggnaaag cagnaagtaa tgagcttgtc cgtcagctgg tagctttcat tcgtnaaaga 12 0 

gataaaagag tgcaggcgca tcgaaaactt gtggaagaac agaatgcaga gaaggcgagg 18 0 

aaagccgaan agatgaggcg gcagcagaag ctaaagcagg ccaaactggt ggagcagtac 24 0 

agagaacaga gctggatgac tatggccaat ttggagaaag agctccagga gatggaggca 3 00 

cggtacgaga aggagtttgg agatggatcg gatgaaaatg aaatggaaga acatgaactc 3 60 

aaagatgagg aggatggtaa agacagtgat gaggccnagg acgctgagct ctatgatgac 420 

ctttactgtc cancatgtga caaatcnttc aagacanaaa atggccatga agaatcacga 480 

gaagtcnaan aagcatcggg aaatggtggc cttgctaaaa caacagctng angangaacg 54 0 

aagaaaattt ttcaagacct caaattgatt gaaaatccat tagatgacaa ttcttgagga 600 

agaaatgnga aagatgcacc aaaaacaana agctttctac acantnaaat ccnannaact 660 

ccatccntct anaactatnn gtgagtcctt nttacntcna tccagacatg antancnata 720 

cnattgatgg aacc 734 

<210> 520 
<211> 701 
<212> DNA 
<213> Homo sapiens 

<400> 520 

ctaatgctgg ctnttgttct ttttgcagga tcccatcgat tcgaattcgg cacgagccca 60 

catgtaccag gttgagtttg aagatggatc ccagatagca atgaagagag aggacatcta 12 0 

cactttagat gaagagttac ccaagagagt gaaagctcga ttttccacag cctctgacat 18 0 

gcgatttgaa gacacgtttt atggagcaga cattatccaa ggggagagaa agagacaaag 24 0 

agtgctgagc tccaggttta agaatgaata tgtggccgac cctgtatacc gcactttttt 300 

gaagagctct ttccagaaga agtgccagaa gagacagtag tctgcataca tcgctgcagg 360 

ccacagagca gcttgggttg gaagagagaa gatgaaggga catccttggg gctgtgccgt 42 0 

gagttttgct ggcataggtg acagggtgtg tctctgacag tggtaaatcg ggtttccaga 480 

gtttggtcac caaaaataca aaatacaccc aatgaattgg acgcagcaat ctgaaatcat 54 0 

ctctagtctt gctttcactt gtgagcagtt gtcttctatg atcccaaaga agttttctaa 600 

gtgaaaggaa atactagtga atcacccaca aggaaaagcc actgccacag aggaggcggg 660 

tccccttgtg cggcttangg ccctgtcagg aaacacacgg g 701 

<210> 521 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 521 

naacacttng ctacnngttc tttttgcagg atcccatcga ttcgaattcg gcacgaggag 6 0 

atctctggga tgtcagtgag gctggttgaa gaccagaggt aaactgcaga ggtcaccacc 12 0 

cccaccatgt cccaggtgat gtccagccca ctgctggcag gaggccatgc tgtcagcttg 180 

gcgccttgtg atgagcccag gaggaccctg cacccagcac ccagccccag cctgccaccc 24 0 

cagtgttctt actacaccac ggaaggctgg ggagcccagg ccctgatggc ccccgtgccc 3 00 

tgcatggggc cccctggccg actccagcaa gccccacagg tggaggccaa agccacctgc 360 

ttcctgccgt cccctggtga gaaggccttg gggaccccag aggaccttga ctcctacatt 420 

gacttctcac tggagagcct caatcagatg atcctggaac tggaccccac cttccagctg 480 

cttcccccag ggactggggg ctcccaggct gagctggccc agagcaccat gtcaatgaga 540 

143 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 PCT/US98/27610 



aagaaggagg aatctgaagc cttgggtaag gatttggggc acagtaccag gaggggggct 600 

tggtgccaga cctcatgagg aagaaggatt ttcctatgta cagagaaggg gacccctgtc 660 

ctgttgggan gtgctgtgca aacctaacca aagttactaa cccctctggt ttctgnggtt 72 0 

acacaaangg ggataaatac aaagctttnc ctnaactagc caattctatt tgggtttcct 



gagt 



780 
784 



<210> 522 
<211> 719 
<212> DNA 

<213> Homo sapiens 
<400> 522 

ttctaatttn aatccttnaa atnggttctt tntgcaggat cccatcgatt cgaattcggc 6 0 

acgagagaac acaggtgtcg tgaaaactac ccctaaaagc caaaatggga aaggaaaaga 12 0 

ctcatatcaa cattgtcgtc attggacacg tagattcggg caagtccacc actactggcc 18 0 

atctgatcta taaatgcggt ggcatcgaca aaagaaccat tgaaaaattt gagaaggagg 24 0 

ctgctgagat gggaaagggc tccttcaagt atgcctgggt cttggataaa ctgaaagctg 300 

agcgtgaacg tggtatcacc attgatatct ccttgtggaa atttgagacc agcaagtact 360 

atgtgactat cattgatgcc ccaggacaca gagactttat caaaaacatg attacaggga 42 0 

catctcaggc tgactgtgct gtcctgattg ttgctgctgg tgttggtgaa tttgaagctg 48 0 

gtatctccaa gaatgggcag acccgagagc atgcccttct ggcttacaca ctgggtgtga 54 0 

aacaactaat tgtcggtgtt aacaaaatgg attccactga gccaccctac agccagaaga 6 00 

gatatgagga aattgttaag gaagtcagca cttacattaa gaaaattggc tacaaccccg 660 

acacagtanc atttgtgcca atttctggtt tggaatggtg acaacatgct ggagccaat 719 

<210> 523 
<211> 710 
<212> DNA 

<213> Homo sapiens 
<400> 523 

tnnncttcaa atcgntngct cttgttcttt ttgcaggatc ccatcgattc gaattcggca 60 

cgagagatta tgagcatgta gaagatgaaa cttttcctcc tttcccacct ccagcctctc 12 0 

cagagagaca agatggtgaa ggaactgagc ctgatgaaga gtcaggaaat ggagcacctg 18 0 

ttcctgtacc tccaaagaga acagttaaaa gaaatatacc caagctggat gctcagagat 24 0 

taatttcaga gagaggactt ccagccttaa ggcatgtatt tgataaggca aaattcaaag 3 00 

gtaaaggtca tgaggctgaa gacttgaaga tgctaatcag acacatggag cactgggcac 36 0 

ataggctatt ccctaaactg cagtttgagg attttattga cagagttgaa tacctgggaa 420 

gtaaaaagga agttcagacc tgtttaaaac gaattcgact tgatctccct attttacatg 480 

aagattttgt tagcaataat gatgaagttg cggagaataa tgaacatgat gtcacttcta 54 0 
ctgaattaga tccctttctg acaaacttat ctgaaagtga gatgtttgct tctgagttaa 
gtagaagcct aacagaagag caacaacaaa gaaattgaga gaaataaaca ctggccttgg 

aaagaaggca ggcaaagctg ctgagtaata gtcagaccct aggaaatgat 710 

<210> 524 
<211> 730 
<212> DNA 

<213> Homo sapiens 
<400> 524 

ttnnnnnttt aancnttcaa atcnctaggc tacttgttct ttttgcagga tcccatcgat 60 



600 
660 
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tcgaattcgg cacgagccca cactcggaca ctgtggaatt ctaccagcgc ctgtcgaccg 12 0 

agacactctt cttcatcttc tactatctgg agggcactaa ggcacagtat ctggcagcca 180 

aggccctaaa gaagcagtca tggcgattcc acaccaagta catgatgtgg ttccagaggc 24 0 

acgaggagcc caagaccatc actgacgagt ttgagcaggg cacctacatc tactttgact 300 

acgagaagtg gggccagcgg aagaaggaag gcttcacctt tgagtaccgc tacctggagg 360 

accgggacct ccagtgacac cggcccctnc ctctacccac ccccttcccc cgcatgctga 42 0 

tccccctgcc caggtaaggg ccctgccctg gaagactgga gggaggcccc aagccacggg 48 0 

gcatccccct ctcccaggaa gcagggaggg ggccgggagg ttttcctctc aagccccacc 54 0 

ctgggggccc gggggcgagg gctgccccct cctcccctcc ccagtgaggg acattttttg 600 

gtaaaaccta ttttcatttt ggaaaatatt tatgaataaa tagttttata tgaaaaaaat 660 

tntngnnntt nnnatnnnan aataaaancn tcgnncctct taaaactata gtgaagtcgt 72 0 

attaccttag 730 



<210> 525 
<211> 711 
<212> DNA 

<213> Homo sapiens 
<400> 525 

gcngntnttn antttcaaat cgctnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggataaa tacctcagcc cctcgccttc ctcaacccac ctggcaagtc 12 0 

ttcttaggat ctgatcccag ttttctggaa gcaatcctac cccagcccaa gcttcccaga 180 

gtcgagcctt aatccttctc acttctcagt gtcagagcag aaatgaatcc tggggttgac 24 0 

tgtgtccatt cgggttatta gcagctaaga agcccagacg agtagtgtga gctgccttgg 300 

gagcctcagt gagggcactg ggactggcct cactctcttg cccccagcct agtgggcttt 360 

ctcctctgtc tctccggtgg ccccaggcaa tcgactgcat cacgcaggga cgtgagttgg 42 0 

agcggccacg tgcctgccca ccagaggtct acgccatcat gcggggctgc tggcagcggg 480 

agccccagca acgccacagc atcaaggatg tgcacgcccg gctgcaagcc ctggcccagg 54 0 

cacctnctgt ctacctggat gtcctgggct agggggccgg ccaggggctg ggagtggtta 600 

gcccggaata ctggggcctg ccttagcatc ccccatagct tccacagccc cagggtgatc 660 

tcaaagtatc taattcacct taacatgtgg gaagggacag gtggggcttg g 711 

<210> 526 
<211> 692 
<212> DNA 
<213> Homo sapiens 

<400> 526 

tacangctac ttgttctttt tgcaggatcc catcgattcg aattcggcac gagagaacag 6 0 

ggagaagaga ggaagaggga gctgcaggtg ccagaagaga acagggcgga ctctcaggac 12 0 

gaaaagagtc aaaccttttt gggaaaatca gaggaagfcaa ctggaaagca agaagatcat 180 

ggtataaagg agaaaggggt cccagtcagc gggcaggagg cgaaagagcc agagagttgg 24 0 

gatgggggca ggctgggggc agtgggaaga gcgaggagca gggaagagga gaatgagcat 3 00 

catgggcctt caatgcccgc tctgatagcc cctgaggact ctcctcactg tgacctgttt 360 

ccaggtgcct catatctcgt gactcagatt cccgggactc agacagagtc cagggctgag 420 

gaactgtccc ccgcagctct gtctcccttg ctagagccca tcagatgctc tcaccagccc 480 

atttctctac tgggctcctt tttgactgag gagtcacctg acaaggaaaa acttctatca 540 

gtactttgat atgtcacagt ttcatgttta tccagttcaa tgtattttta aatttttcct 600 

tgagacttct ttgactgata gattattgtg aatgtgtttt taaatttcca aatgtttang 660 

gattttcata tctttcttat gctgatttcc aa 692 
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<210> 527 

<211> 769 

<212> DNA 

<213> Homo sapiens 



<400> 527 

gttctngttc tttttgcagg atccctcgat tcgaattcgg cacgaggcca agcctcggcc 60 

tccactgcac ctgctgcgga gtgggcacct ttgcctgcaa ggccttttnc ccantgncca 120 

atggtanttt aaccagggtt tttgncnntt aaggaggcct tngtggtggg tngttaatct 180 

ggccnttccn tattgaaaag ct.cctgttat tgtccacaga ccagaaggac ttgtaacctt 240 

ggtcccacag tctgacttng gcttttcaag cacccagaaa acttagaggg aatcttatag 300 

attccagaac ttaaggatac ctcaagggat agggtcacag ccaagaagtn caaaggaatc 36 0 

ttcagtctgg aacaaaaaca gaaccctttc atgattgaca aangtcactt tctgtttgcc 420 

tggaccaagc tactncagat catctgacca actcttaaaa atcacggcca ggcacagtgg 4 80 

ctcatgcctg taatcccagc actttgggaa gcaaaagtgg caggatcatt ncagcccaag 540 

agttcaagac cagcctgggc aacacagtga gtgagaccct gctctattta agaaaaatna 600 

ttaagaaatt tattaaaaaa gaagaatcag gaaaccaagt ncaacccaac ttaacctcaa 660 

tgaaccagcc cctaacacag atgangggat ttgggactga taagctctgt gctgngtcca 720 

tggcccgtca nttatcaagg ttgcactttt aaatgnggta tttttatgn 769 



<210> 528 

<211> 757 

<212> DNA 

<213> Homo sapiens 



<400> 528 

tnaatatcag ctcttgttct ttttgcagga tccctcgatt cgcangaggg tgttcgactg 60 

ctngagccna gcgaancgat gcctaaatca anggaacttg nttcttcaag ctcttctggc 120 

ngngattctg acagtgaggt tgacananag ntaancagga aaaacaagtn gctccagaaa 180 

ancctgtaca gaaacataag acaggtgana cttcgagagc cctgtcatct tctaaacaga 240 

gcagcatcng cagagatnat nacatgtntc atattgggaa aatgaggcac gttantgttc 3 00 

gcnattttaa aggcaaagtg ctaattgata ttanagaata ttgnatggat cctgaaggtg 360 

aaatgaaacc aggaagaaaa ggtatttctt taaatccana acantggagc cagctgaang 420 

aacagattct gacattgatg atgcagtaag aaactgtgaa attcgagcca tataaataaa 480 

acctgtactg tctagttgnt ntaatctgtc tttttacatt ggcttttgtt nnctnaatgt 54 0 

tctccangct attgtatgtt tggattgcag angaatttgn angatgaata cttnntttta 6 00 

atgngcatta ttaaaaatat tgagtgaagc tnatngtcaa ctttattaag gattactttg 660 

ctgccaccac ctagtgtcaa ataaaatcaa gtaatacaat cttaataaac ntttaaacta 720 

taaaaactcg acccttagac ctatantnag tcggttn 757 



<210> 529 

<211> 821 

<212> DNA 

<213> Homo sapiens 



<400> 529 

tnannnannc annnnnnnnn nnnnntttga agccattgct acttgttctt tttgcaggat 6 0 

cccatcgatt cgaattcggc acgagagcaa ttccactcct agctccaccc acaggaaatt 120 

gaaagcaaag acgcaaacag atgcctgtgc accaaagttc acgggcaagc atccttcggc 180 

cttaatgggc agcattccgt cgtcacaagc gggcattcat cctttcatca atagcgggca 240 

gcattccgtc gtcacaagcg ggcagcattc ctttcgccac aagcgggcag catcttgtcc 300 
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gtcacaagcg 
atcctttgcc 
aaacatggcc 
gaatgagacc 
ctaagtgaaa 
gaatgggcac 
ttgggagcga 
ttggnccgtt 
ctttaaatgg 



ggcagcatcc ttcgccaaag cgggcaagca tccttcgtca tagcggcagc 
atagcgggca aggtggaaac cctgtccatc cactgaggcg tgcatagact 
agtccaggca ctggaatcca ggcccgtaga acggcgccca cggtcaaaag 
ctgatgcact gggcgacaca gacgggcgac acagacttgg agacatcatg 
agccaggcac acggagcgga cggcgtgatc ctgctcacgt gatgtgtccc 
gttcagaggg aagaagggag atggcgcttg ccggtgcccg gggacngggg 
cggttgctgg tttggggttt ctttctgggg tgangaantg gttttgatat 
ggtgatgttt gcatacctct gaatatgctt aaganccaca gaattgacca 
atgaattgna tggtattggg aattacccaa n 



360 
420 
480 
540 
600 
660 
720 
780 
821 



<210> 530 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 530 

gnntttnnnn nnnnnnnttt tatnnntaca gctacttgtt ctttttgcag gatcccatcg 6 0 

attcgaattc ggcacgagac taccccggct acggttcccc catgcctggc agcttggcca 12 0 

tgggcccggt cacgaacaaa acgggcctgg acgcctcgcc cttgcccgca gatacctcct 180 

actaccangg ggtgtactcc ggcccattat gaactccttt aagaaagacg acggcttcag 24 0 

cccggtaact ctggcacccc ggatcgagga caagtgagag agcaagtggg ggtcgagact 3 00 

ttggggagac ggtgttgcag agacgcaagg gagaagaaat ccataacacc cccaccccaa 360 

cacccccaag acagcagtct tcttacccgc tgcagcccgt ccgtccaaac agagggccac 42 0 

acagataccc cacgttctat ataaggagga aaacgggaaa gaatataaag ttaaaaaaaa 480 

gcctccggtt tccactactg tgtagactcc tgcttcttca agcacctgca gattctgatt 540 

ttttggtggt gtgtctcctn cattgctgtt gttgcaggga agtcttactt aaaaaaaaaa 6 00 

aaattttgtg agtgactcgg tgtaaaacca tgtagtttaa cagaaccaga nggttgacta 660 

ttgttaaaaa caggaaaaaa ataatgtaag gtctgttgta aatgaccaan aaaaaaaaaa 72 0 

aaactcngcc tntaaactnt tntgagtcgt nttcgtaaat ccaan 765 

<210> 531 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 531 

gnntttnnnn nnnnnnnttt taagntactg ctacttgttc tttttgcagg atcccatcga 6 0 

ttcgaattcg gcacgaggtt cttcaaagcc aaccaagaca ggcttagcag ttttagagct 120 

tcagaacaaa ttgccaaaag ccagagttgt ttatgctagt gcaactgggt gcttctgaac 18 0 

cacgcaacat ggcctatatg aaccgcttgg catatggggt gaggggtact ccatttagag 24 0 

aattcaagtg attttattca agcagtagaa cggagaggag ttggtgccat ggaaatagtt 3 00 

gctatggata tgaagcttag aggaatgtac attgctcgac aactgagctt tactggagtg 360 

accttcaaan ttgaggaagt tcttctttct cagagctacg ttaaaatgta taacaaagct 420 

gtcaagctgt nggtcattgn cagagagccg gntcagcaag ctgcagatct gattgatgct 480 

gancaacgaa tgaagaagtn catgtggggt cagttctggc tgtcaccaga ggttcttcaa 54 0 

atacttatgc atagcatcca aagttaaaag ggttgtgcac tagctcgaga ggaaatcang 600 

aatggaaaat gtgtngtaat tggctgcagt ctcaggagaa gctnnaacat tagaactttn 660 

gaagaaggcn ggggagaatt gatganttgg ttcaactgcc aaagtgtgtg cantcactca 72 0 

ttggaaaaca tttnctgctc cagcngggaa aacttatggt tacttggn 768 



<210> 532 
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<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 532 

cgtnttttnn nnccnannga aagcccttgg ctacttgntc tttttgcagg atcccatcga 60 

ttcgaattcg gcacgaggat cagcccacct cggcctcaca aagtgntggg attacaggcg 120 

tgagccacct tgcccaccca catcatacag ttgaaatgaa actttgccac aaccagcctt 180 

tgctgtacac acacatatat cactgaacct ggttgaaata aagntttttt tctttttcct 240 

ctggtattct gggttctgaa gtctggtatt ctggtattct gggttcaaaa gtatgacttg 3 00 

agagtgttgc tctggtattc tgagagttgc tctgtattct gggttctgaa gattatttga 360 

aaaataactc ctactacatt gaaatgcaga cttaaaaatt taaacattgg attaggcagt 420 

caaaaaaacc aagcaagcat aaaaggtcaa taagttgtaa tcttgatagt aaaggtggaa 480 

aacttattat aaatggaaag aaagtttatt tccttttttg gttgatgggc agtatgccat 540 

attataccca aagttctttt aaaaaatatt tccatcacca tttttattta aaataaacat 600 

ttgagggaag taccaaggca gcttttttcc tcaaaagtac ctggtcctct ttgggaatag 660 

cacattttan gggcattggg taatcctgag attttactca ntaaatcctg atggtactgg 720 

gtgtaaaata tctttagtng gattgaaggc cttgnggggg a 761 



<210> 533 
<211> 735 
<212> DNA 
<213> Homo sapiens 



<400> 533 

taaacatcng gctacttgtt ctttttgcag ggatcccatc gattcgaatt cggcacgaga 60 

cactgtccca ctccatcacc caggctggag tccagtggtg tgatcatagc tcgctgcatc 120 

ctccagttcc tgggttcaag ccatccctcc tgcctcagcc tccccagtag ctggaactac 180 

aggtgtgtgc catcacacct ggctttacat ttttctgtgg ggtcttacta tgttgcccag 240 

gccggtctca aactcctgag ctcaagtgat cctctgcctc agcctccaga gtatctggga 300 

ttacatatgt cggctaccgt gtctggccgt tcacatcttt ggccactatt tgcttgtgaa 360 

aaggtataat gaggtggtac ttatcatttt tactgngtct catgttttgt atatttttgt 420 

ttcatcaact aagatgcact gtaacatctc tgaaatctgg atatattatc aatggtttat 480 

catagttttg ttagcaatac actgtctttt agtggtgcct aaaataatgg tatagttgtg 540 

aggtgatctt agatttgatg aagcacagta tgcaggtagg cctaatgggg gaagatggta 600 

atataaaagc aagaagtatt ttttttttgt aatgactgaa agctgtctgt ggatgaccta 660 

ccctttnctt taaacacgat tntntcactt ncaactncaa acttgctcaa ctaatncttt 720 

aaaaataact tgagc 735 



<210> 534 
<211> 735 
<212> DNA 
<213> Homo sapiens 



<400> 534 

natngnttgc tcctngttct ttttgcagga tcccatcgat tcgagacaac ccagaaacaa 60 

attcatacat ctatggtgac cacttttgac aaaggaatga agaacataca ctgggggaaa 120 

agataatgtc tttaataaat ggtgctggga aaactggntn tccantntgc agaagaatga 180 

aactagaccc ccatctctta gcatatacaa aaatcaaaat taattaaaaa gttaaatcta 240 

agacctcaaa ctatgaaaca gctaaaagaa aacatcgggg aatctctcca ggacattgga 3 00 

gtgggcaaag atttcttgtg taatacctga caaacaggca accaaagcaa aagtggacaa 360 
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atgggatcac atcaagttaa aaatcttctg cattgcaaag gaaataacaa agtgaagaga 42 0 

cacccataga atgtgagata atatttgcaa actatccatc tgtattaggc catttttgaa 480 

gtctacaaag aaatacttga gactgagtaa tttataaaga agaggtttaa ttggctcacg 54 0 

gttttgcagg ctgtcaggaa gcatggtgct aacatctgat cagcttgtag ggaggcatca 600 

ggaagtttcc acccatggtg gangcaaaag gggaataagt ttctccatgg caggtgcagg 66 0 

gcaaaaanan gggggaaggg aagtgccnca caaccagatc ttgtgagtnc tcagatttgn 72 0 

ggngggngct tgngg 735 

<210> 535 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 535 

tnaannanag ctacttgttc tttttgcagg atcccatcga ttcgaattcg gcacgaggtc 6 0 

catacatgga gctccctgga cccgtgtgct ctcgtgtgac tgaacgtttt gtgatgaaag 12 0 

gaggagaggc tgtctgcctt tatgaggagc cagtgtctga attgctgagg agatgtggga 18 0 

attgcacacg ggaaagctgt gtggtttcct tttacctttc agctgaccat gaactcctga 240 

gcccgaccaa ctaccacttc ctgtcctcac cgaaggaggc cgtggggctc tgcaaggcgc 3 00 

agatcactgc catcatctct cagcaaggtg acatatttgt ttttgacctg gagacctcag 360 

ctgtcgctcc ctttgtttgg ttggatgtag gaagcatccc agggagattt agtgacaatg 42 0 

gtttcctcat gactgagaag acacgaacta tattatttta cccttgggag cccaccagca 480 

agaatgagtt ggagcaatct tttcatgtga cctccttaac agatatttac tgaaggaatc 54 0 

taggttgtat tttcagtgga caatgggaat aaagcatttc taaagcaccg actggagagg 600 

aaggcaacag aaacaaggag agaagcccga gagacatgtc tgcgtgctgc cacgcatctg 660 

ancgattgct cttgtgaaga gtttgtcact gaacattttc aggggaggct gtttacccag 720 

cnatgtnctn aacan 735 

<210> 536 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 536 

gccccccnnn nnnnnnnttt tcaaannccn ttnnnnnnnn nngnnnnttt tannnnnttn 6 0 

ttannnnaca gctcttgttc tttttgcagg atccctcgat tcgattcggc acgagctacc 120 

ttgggctggc cctctatnat gctntgaggg gagctgggac agatgatcnt nccctcntca 18 0 

gngtcatggn tnccangngt gagnttnatc tgccnnacat ngtgacggag tttaggaaga 24 0 

atgntgccnc ctctntttat tccatgatta aggganatcc atnnggggac tataagaaaa 3 00 

gcnnttttnc tgctntgngg ncaanangan tnacnngncc cgggnnanag ctcctatgct 360 

gtntgcctgc accaccccct gccttccttc atacctttcc ntggatatgn atgccagggc 42 0 

ttnncacatt gcctnattna tactnacntg ctnatgacca anacatncac gtgataacac 480 

aaacantggg tgcttgnttc tgatcnctag aggnganctn ttggnnngnt ggagnactna 54 0 

antnttctna gtgtnacttn agttcaatgc ctggccatnt gcnatnacct tatatcntnc 600 

aaagaggcta ctgtgctttt ancctttttt aaaacctcca tctgtattac attgnnaacc 660 

angtttcttt aatnaggagc ttgacctcta nantgggaac tcttgggaat ggncttagtg 72 0 
aagttcgcna ctaacttaac ctgaaaatta tnatgnnctg tttnacctat catgttnata 
actnt 



780 
785 



<210> 537 
<211> 967 
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<212> DNA 

<213> Homo sapiens 



<400> 537 

agtanggcgn ttcctaatnn annnggctaa gcgactttna aagangaggc tngcgtgntg 6 0 

aataccgnnc gaggggggat nacaatagta nacnnggtnc caatncatgc ttaacaccgc 12 0 

atntctttac cccccnannn ncacanatgc agacncacac atngcanncg nacacncaga 18 0 

cacacacang caagcactnn catgcatggc ccatgctcac acacntgnan nnaacatgcn 240 

gtagacatnt nagacacgtc atgtnacaca tgnnacacan gnnnaanaca ctgctttnca 3 00 

ngcanacnca gacggcacnn ngagacanac atgcnnaaac aacatgctcn ctcacntnna 360 

nncgntgggc cngtagtagt gtactgtggg tgnnactggg tgccatcnac nnngtatttt 420 

acgnnctttn aactaaaaan cttggagcct tnanttnntn tggtgantnc aatncctana 480 

antnncttga gngggatgaa ccctaananc ctggccctnn tnccnctttc aaggccnagn 54 0 

aattganatt attncntant ngnncacgaa gcttntggta ncangngncc cgagnnctnt 6 00 

tnaaanttnn ctnttttnan aatnaaacat tttancggtt ctnaggancc gngcctncng 660 

ggtanggann naattgtncc tgggnatagt tctcacaant natnttnaag gggnnaagng 72 0 

atnngngngg nccntntatg nggcnngcca annaangggg tcgrmgttaa natattccaa 78 0 

gntaacanan gnacnatggn accnatccct ntnngaagna aggaactncc tgnncgacta 84 0 

nnnactatgn naaatattct cacatntaca naaaaagnag gnnccnnggt ncttnaagnt 900 

tntgcatagn nactatncnt gggacgngtt aacnnanatt ntatgcttta nnngatnggg 96 0 

gcttnnn 96 7 



<210> 538 
<211> 892 
<212> DNA 
<213> Homo sapiens 



<400> 538 

gctagttnga agaggtgttt ctaangnntn ggaatcgaca tctnnnnagg cngnccntgc 6 0 

gattcgcttt gctctctcca ttccaagttg ttctctgttc tagaaagcng atgnngggnt 120 

acatctactg tttttgccta aacagaatcc ctttntcctt tttttgttaa aaggctcatn 180 

cctaatatta cattgctctg gaacgantga caataccana actcagcacc ntgatcggac 24 0 

cgggacaatc agattatcta attcctcagc aaacggagat cgatccgaaa agtggaaata 300 

tganctcntn ctttgtgntg gcatatggac cctgagagaa agaaacttta atcttttact 36 0 

cttggactgc aatnaagtnt agctgcctaa aaatcnnttt cntgacactt ngnaggtttg 42 0 

tccacaatcg ggngaaatta nngggtnnga cntaancact ggatgaaaaa aaatnccgnt 48 0 

tanttntatt ncnnttccan ncttntnaaa tanananttt ntcanccttn nntaatacta 54 0 

ttanntatat ntnttnnncc cnnatnnncc ttcttnctcc tacnncnntn cnatntnnnn 6 00 

nnangntcnn cnannnnttc tnttatttct annatatntc ntancnttna ctaaaacctc 660 

cnctcgtnna nattncnnta taatattntc tctaganntt ntnntntntt gnnncttaaa 72 0 

anctcntcta tccctantat nantnattct taccatnaaa tacactanaa gtnntntcac 78 0 

gagacncgnt atgttantnc anactataat cgcttncatn tanntatatn taaaantgct 84 0 

atncagnnag nngntnttat atntttanct ngnnaggnta tcctcnatan cc 892 



<210> 539 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 539 

gnnnaggttn tagancagct cttgttcntt gngcaggatc cctcgattcg aattcggcac 6 0 
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gagagtgtca gttttcctaa tctcagtcca ggtaggaatt aagaaatatc tcaagtgttg 120 

atgctatcca agcatgttgg ggtggaaggg aattggtgcc cagaaaatgg gactggagtg 180 

aggaatatct tttcttttga gagtaccccc agtttatttc tactgtgctt tattgctact 240 

gttctttatt gtgaatgttg taacatttta aaaatgtttt gccatagctt tttaggactt 300 

ggtgttaaag gagccagtgg tctctctggg tgggtactat aatgagttat tgtgacccac 360 

agctgtgtgg gaccacatca cttgttaata acacaacctt taaagtaacc catcttccag 420 

gggggttcct tcatgttgcc actccttttt aaggacaaac tcaggcaagg agcatgtttt 480 

tttgntattt acaaaatcta gcagactgtg ggtatccata ttttaattgt cgggtgacac 540 

atgttcttgg taactaaact caaatatgtc ttttctcata tatgttgctg atggttttaa 600 

taaatgtcaa agttctcctg ttaaaaaaaa aaaaaaaaaa actcgancct ntanactata 66 0 

gtgagtccnt attacgtaga tccagacatg atnagatcat tgatgaattt ggaccaaccc 72 0 

aactagaatg cagtgaaaaa aatgcttttn t 751 

<210> 540 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 540 

gntnggntcn agancagcta cttgttcttt tgcaggatcc ctcgattcga attcggcacg 6 0 

agcctgcagc cactaatgca ttgtgtatga taacaaaaac tctggtatga cacattttct 12 0 

gtgatcattg ttaattagtg acatagtaac atctgtagca gctggttagt aaacctcatg 18 0 

tgggggtggg gtgggggtgt attccttggg ggatggtttg ggccgaatgg ggagtggaat 24 0 

atttgacatt tttcctgttt taaattctag gatagatttt aacatccttt gcggtcccag 300 

tccaaggtag gctggtgtca tagtcttctc actcctaatc catgaccact gtttttttcc 360 

tatttatatc accaggtagc ccactgagtt aatatttaag ttgtcaatag ataagtgtcc 420 

ctgttttgtg gcataatata actgaatttc atgagaagat ttattccacc aggggtattt 480 

cagctttgaa accaaatctg tgtatctaat actaaccaat ctgttggatg tgggttttaa 54 0 

aaaatgtttg ctaactaccc aagtnagatt tactggatta aatggccctt cgggtctgaa 600 

aaagcttttt taacttcttn gcttaaaatg ccgtttaatt ttgataagat ncttnaaatn 66 0 

gcctccaaaa gtgttananc caatcatttn aaataaaccn ggntgtatat tgcatnatgt 72 0 

gtacatgcnt atncccttct ggttaaaact naaaaaaaaa t 761 

<210> 541 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 541 

ggtttanttt aaatccntnc ncagctactt gttctttttg caggatccca tcgattcgaa 60 

ttcggcacga gcggacccat cggagcgtaa cctggatctc cgcaggcctg gcggaggccg 12 0 

gccacctgga ggggcattgc ttggttcgcg tggtancaga ggagcttgag aatgttcgca 180 

tcttaccaca tacagttctt tacatggctg attcagaaac tttcattagt ctggaagagt 24 0 

gtcgtggcca taagagagca aggaaaagaa ctagtatgga aacagcactt gcccttgaga 3 00 

agctattccc caaacaatgc caagtccttg ggattgtgac cccaggaatt gtagtgactc 360 

caatgggatc angtagcaat cgacctcagg aaatagaaat tggagaatct ggttttgctt 420 

tattattccc ttcaaattga aggaataaaa atncaaccct ttcattttat taaggatcca 480 

aagaatttaa cattagaaag acatnaactt actgaagtag gtcttttaga taccctgaac 54 0 

ttcgtgtggt cttgnctttg gttataattg ctgtaaggtg ggagccagta attatctgca 600 

gcaagtagtc acncttttca gtgatatgaa tatcatcttt ggcttggang ccantngaca 66 0 

acctgncatt actgactttt tgaaaanaac cctctggata ttgatgcctc gggtgtggtt 720 
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ggactgncat ttagtggacc ccgaatcc 74 8 

<210> 542 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 542 

gtnnnnntng tgtaatcgct tggctgcagg atccctcgat ggcgaattcg gcacgaggtg 6 0 

ttgctcaang agcagacccg actccntaag gtcatcattg aatgggcatn atangtttga 12 0 

anactgtcca ananantang ngtcaataca tcaacnnctt tanntgcttg atattgnnat 18 0 

tgaanaacac angnctcngn ctagttcgcc tganatgatg tttaagatac tccggaagga 24 0 

gacanantgt tntgantgcg gattaganac cacngaagnn acactnaagg ancancatct 300 

ccacctngna actgnattnn cngaccanaa aagngaactg gaccaaatgc tctcaaaggt 36 0 

gctggcagct taanagcgtg ttangactct gcacgaagan gacaggtnnt ntgagagcct 42 0 

ggnnannaca ctctcccaaa ctaaactgna nctttcaaca nangggancc ccannttggt 48 0 

ggagaaatca ggtganctgt tggcccttcc acaaagangc aaattctntg agggcnagac 54 0 

ttnanccttt ttgcngaacc agtncttgac tgactaaatg aaagcttttt aagccaggtg 600 

gcccancctt aangaagcna ctttttaatc cancggaacc ngcttgagan aaaaccnttt 66 0 

ttgacccaaa accnggagaa ccagctggcc taccaaaggg aaatgggccc ccatttgaac 72 0 

ttggggttnc ccangaacaa nccttgnccg ggncaaagcc cnttgttgga aaggacctca 780 

acct 784 

<210> 543 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<400> 543 

ntantaaatc ccttgctctt gttctttntg caggatccca tcgattcgaa tncggcacga 6 0 

ggacccggcg gcgcggacag gcttgctgct tcctcctcct nngactcacc attncaganc 120 

agaanntgaa aaaatggnng anctcaccca ggtaanggat gatgaagtnt tnatggctnn 180 

tgcatactat gcannanttn tncttntgna aatgatgcnt atgagtactg taanngnntt 240 

ctatncattg ncaagaangg ntnttgncaa tncatangac tgtgtagcat tcggcanagg 300 

agaaaatgnc aagaactatc ttcgaacaga tgacanagtg taacgggtac gcagagncca 36 0 

cctgaatgac cttgaaaata tnattccatt ncttgnaatt ggcatnctgt attccttgag 420 

tggtcccgac ccctctacag cnntcctgta ctttagacta tntgtcggag cncggntcta 480 

ccacaccatg tgcatatttg acaccccttt cnnatccaaa tatagctatg actttttttn 540 

gtaggatatg gannactctt tccatggctt acacgntgcn gtaaagtaaa ttggccctgt 600 

gcagaaaaac attccactca gtnttccaan tggcttntta aggaattctn gaccttgcaa 66 0 

ttnatantgg agnnctttcc ttaagattta aaggtttgan ggngagccnn aggaattntn 72 0 

aaccnggggt aaaccctttt tggaattttn agcnttgnca anaa 764 

<210> 544 
<211> 755 
<212> DNA 

<213> Homo sapiens 
<400> 544 

gatgctggnt ncnnatgctt gnngatccct cgattcgaat tcggcacgag gaaatgtgta 60 

tttcagtgac aatttcgtgg tctttttaga ggtatattcc aaaatttcct tgtattttta 120 
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ggttatgcaa 
atacaggata 
caaagtttga 
ggaaacattt 
tgagagatcc 
attttacatc 
tgaaggtgaa 
aaattcaaag 
atatgtcnta 
gctgnaatgc 
aactttcata 



ctaataaaaa ctaccttaca ttaattaatt acagttttct acacatggta 
tgctactgat ttaggaagtt tttaagttca tggtattctc ttgattccaa 
ttttctcttg tattacattt tttatttttc aaattggatg ataatttctt 
tttatgtttt agtaaacagt atttttttgn tgtttcaaac tgaagtttac 
atcaaattga acaatctgtt gtaatttaaa attttggcca cttttttcag 
attcttgctg aacttcaact tgaaattgtn ttttnttttc tttttggatg 
cattcctgat ttttgctgat gtgaaaaagc cttggtattt tacattttga 
aagcttaata taaaaggttg cattctctca ggaaaaagcc atcttcttgn 
aatgtatttt tgncctcata taccggaaag ttcttaattg gattttacca 
tttganggtt ttaaaaataa taacattttt aataattttt taaaaggaca 
atnatcccgg ngntcctttn ccnnn 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



<210> 545 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 545 

agnttttnaa tcctttggcc antcgcnctt tntgcangat cccatcgatt cgaattcggc 60 

acgagaaaaa gtnaagcttt tcatgagcac anntnccttg cattgttnga tgttactgat 120 

attcgtaaaa tgaatatttt ctgttttgtt ctgttnnatt tttttgagac aagtcttgct 18 0 

ttgttgccca ggctggagtg caatggcatg atcttggctc actgnaaccc ctgccttgcg 240 

agttcaagtg attcttctgc ctnagnctcc tgagtagctg ggattacagg cgctcaccac 300 

cacacccagc taatttctgt cttttnagtn gacacagggt tttaccatgn tggccaggct 360 

ggtctcaaac tnctgacctg aaactnctca caccngtnat ctcagcactt tgggaggctg 420 

angtggaaag gatcacttga agccatgagt ttgagaccag cctgngcnac acagcngaga 480 

ccccngtgnt gtacaaaagc ttncnacatt tanctggctg aggagtnnct cacccntaac 540 

ttccancnan tcnnttaagc nnanncatnt tgaacacntg agcccannta nggtcgatgc 600 

tnntagtnaa ccgtgactgg accacttaca gtccaagccc gggtngcctt ataaaagaan 660 

cggaaaacat ttcnttaatt cgggttnnag cnttanctat ttcggaatnc cttgngtttt 720 

naaaaacttg aatctccaan aaacagggtt ttttcttttg gnccann 767 

<210> 546 
<211> 989 
<212> DNA 
<213> Homo sapiens 

<400> 546 

tncccttgtt gaaanccctt tgctcctttn tnctnccgtt tgncatncna ttcgctcagc 60 

tgaggcaatt aaactggaaa agaaatagat tgaaaagata ctntngaaga agcagtacag 120 

aagttggggg actgaaggag agggagccac tgcaggtgct agctgcttaa ggggatacca 180 

gtccttttac agatataata gatacagctt ctgaggtgga gggtgatagg agtgtgtatg 24 0 

agaaanttgc agnttnacaa ctgctcntgc ctcctnggca anaggannan cntttcnccn 3 00 

nttncnnccc ttatngnaca cacattgncc tgattggncn tnccncngct agcttncagt 360 

cttnantnta ctcannagnn nntnggggaa cncnctntcn nantatgntc ccttttcctc 420 

tnncntnncc nnatancacc ccnctcnctt tcctttctaa acttncacan ntccctgana 480 

atgncttccg aatggantct tngaatttct ncgcccctnc ntcntcataa tcnttttgct 540 

nctccngctc nccctcattt tnctacgtnc cnccttctnn ttnactgnct ttaaatntta 6 00 

ttancnncnt ntncnttncn atctncaant tttcnnnccn acnnnntttt nctnntnnca 660 

aatcgcgnna aataagtntt gcncactcnn ntnctancnt attntccctc gcnnttntcn 72 0 

tcatctcccg cnncactcac ntnnncnnnt caattnntnn nnacncncnc tgctctacnn 780 
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ncnatntctn tncctncaca ccctntancn tntcnctcan aatgcctttt ctnccttann 840 

nctntcnttc ncnnatctan ccaantttnc tttnacatcc cctncnnntc tnncccgacn 900 

atatntnacc tcttnnatcn cagngcntan natcnccccn ttntcnctnt cnctctcann 960 

cttntnttna tcttcatnna tcanncncc 989 

<210> 547 
<211> 781 
<212> DNA 
<2 13 > Homo sapiens 

<400> 547 

tgtnnctttn cnncctcnnc cgaaatcnct ttgnttctaa ctttcctaat tacctgggct 6 0 

acttgcacta tcccntcgat ncgcatagat ggccnngtta ctaanggtga ntttccagcg 12 0 

cggggggcac gtggagtcac tggaacattt gngcaatgct ggtgggaatg tcaacccgng 180 

cnggcctctg gaatangcct ggcnnntcct gcnagagtta ccntgtgacc cagcaattcc 240 

actcctagct ccacccacag gantngaaag cnaagacgca nacagatgcc tgngcnccaa 3 00 

anttcacggc agcatcctnc gccatantgg cancatccgt cgtnacagcg gcatcatcct 360 

tcatcattac ggcancatcc gtcgtaacag cggctacatc acttcgccac agnggcagca 42 0 

tctgtngtca cagnggcngc anccttngcc aaagcggcag cntccttcgt catagcggna 480 

ncatnctttg ccatancngc naggtggaaa ccctgnccat ccactgaggc ntncatanac 54 0 

tanncatggn cagtccaggg cactggaanc cangccgtng aacggcgccn acggtnanna 6 00 

ggaatganac cntgatgcnc tggggccana catactggct anacanactt ggagacatca 66 0 

tgcttanttg nannnccant cacacttgcn nncggcgtna tcctgctcac gtgatncgac 72 0 

ccgaatgggc acttcaaatg ggaanaaggg ngatggcact nccggtnncc tnganagggg 78 0 

n ~ 781 

<210> 548 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 548 

tctaaacgct tggnncttgc tctttctnca ngnanccnnt gcgntncgaa ttcggcacga 60 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 120 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 180 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 240 
aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 
gaaatggttg ttcatgtcat ttgctcattt ttatattaga ttgtttttct tttttccagc 

tgacttgtag gaactctaca tcttatcaat attaatcatt tatcgaaaac tatttgggtg 420 

ccattatctt ctcctagtca atgttttttg tttgtgatat cttttataat atataagttt 480 

ttaatgttgg cagaagtaaa gttaatcttt ttggctgtgt tgtgtgtctt gtttgatgta 54 0 
aagatagttt ctgtaatagt tttgcagttt gattgntcat ctttaggtct tcaattcaac 
ctgcacatcc atcccctcta tcctctttct tactctgttt ttctccatac cacttatcat 

ccaataatat ggtcatgccc tttattnacc ngntttgcat atataatttg gcttgtnccc 720 

ggttccttcc ctana 735 

<210> 549 
<211> 812 
<212> DNA 

<213> Homo sapiens 



300 
360 



600 
660 
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<400> 549 

ttctaatact tggctctngt tctttcngca ggatcccatc gattcgaatt cggcacgagg 6 0 

ggaaggagcg ggcgtgaggc cagctgaggc atggtgaccc ctgggaagga gcgggcgtga 12 0 

ggccagctga ggcatggcga cccctgggaa ggancgggcg tgaggccagc ttgaggcatg 18 0 

gtgacccctg ggaaggancg gncgtgaggc cagctgaggc atggtgaccc ctgggtacgg 24 0 

gggacttggg ggccgacctt ggtttgccca gggcccctnc tgcaccacgg ccacatgcgg 300 

aggacggcgt tgggatangc tccctgggtc cacagcttct gcccgtgtat tggggaaccc 360 

tncttggtca aggcttcang ctcttggcag atggggcaag gaaccctgag gcttccgcgc 420 

ccttccatgg nctctgatgt gggacacttg aacgangcac gattctgaag gactccatgg 480 

atcttgggan gattangccc accttcngtt ggtggncnaa agccgtcctt ncggggcccg 540 

gcttgtttaa cnggacaact tttcnggtcg ggcttgttgg gccccaatcn ttgggttggg 600 

naanttcncc ttaaaccttg ggcccgncct tttaaccttt tttcccaatc ttttgacctt 660 

tttccaaaaa ggggtncccc tgggcttttt ngggncaatt ggttccgggg gccaaaggtt 72 0 

gggaaaaaat gccttncatt gggnaaaacc ctggatccct tgttaancct ttgggagntt 780 

aaaatggaat gaattttccc cccgggcttt tt 812 

<210> 550 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 550 

ggnnantcna tgctggtctt gtctctntct aaaagttggc nattcgaatt cggcacgagg 60 

ttctgtggct ggcatggtct gcctgctact ggagagatct cctgagantt cagttttgga 120 

ttggtgctgt catcttcctg ggaatgcttg anaaagctgt cttctntgcg gaatttcaga 180 

ntntccgntc caaaggagaa tntgtccagg gtgctttgat ccttgcaaag ctgctttcan 24 0 

cagtgaaacg ctnactggct cgaaccctgg catcatagtc agtctgggat atggcatcgt 300 

caagccacgc cttggagtca ctcttcataa ggttgtagta ncaggagccc tctatctttt 360 

gtnctctgca tggaaggggt cctcagagta ctgggtattt tncttatccc ttgactctga 420 

tagtaaacct ggccctntca gcagtttgac gcctgggtat ttatggatat taattagcct 480 

gactcaaaca atgaagcttt taaaacttcg gaggaacatt gtaaaactct ctttgtatcg 54 0 

gcatttcacc aacacgctta tttggcagtg gcagcatcca ttgggttaat catctggaca 600 

acccatgaag tcaanaatag tgacatgtca ntcggactgg ccggnagctn ttgggtagac 66 0 

catgccatnt ggcgccttgc tggtcttcca tgancccctt tggcaatcat gggtcttntg 72 0 

gcgaaccatt ttgcaaacaa ct 742 

<210> 551 
<211> 736 
<212> DNA 
<213> Homo sapiens 

<400> 551 

agtctaatgc tggtcttgtc ttttctaatg ctnggcgatt cgtcctggtg tcaaacacta 60 

taaacctttg accagctgag ctgtgactgg ctgtcacntn tctgagtcct gtgtgcacag 12 0 

tantntcctg ggtcaggtaa aatccaggtn ttcaagtttt aaggnttttt tgaanaattc 18 0 

gggcttnttt aanacgatcc ntgcccaant ccacaagctt gttgacagtg gnttacagtt 24 0 

ngngtggcaa agtccaagtt gttacactgn gctttaaaaa aaatcttatc tgcatgtatt 300 

gttaacttag agaccatgag atctatttat caggaccagg aagatncaca cttcaggtcc 360 

attgcaactg acttttttct tgtttttctt aaaaccctgg tggagcctgg gaagggggcc 42 0 

tccacaattc tgtggctttg atattagccc caattttaca agcacataca agccccataa 480 

ttgccgcagg aaaacacaag atggaaaatg caataaccca tgcactgaga cttagaaaat 54 0 
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catccttact aggcaaaatg tattatgatg caataagtgc cactgggnat tttnacgttg 600 

ggactggnca ggaactgctg caaagaaaaa taacagctcc ttctccatta tttacattta 66 0 

agatgttggt ggggggaagg ttgggagaaa ttagttctga gggtatcata tgcctttttt 72 0 

aaagaaaatg ggaata 73 6 



<210> 552 

<211> 733 

<212> DNA 

<213> Homo sapiens 



<400> 552 

nagtttaann gtatgtcttg tcttttccaa gatcctatcc gattcgaatt cggcacgaga 6 0 

agtgtcagtt ttcctaatct cagtccaggt aggatttaaa aantntctca agtgttgatg 12 0 

ctntccaagc ntgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 18 0 

aatatctttt cttttgagag tncccccagt taatttntnc tgtgcttnat tgctnctgtn 240 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc ntagcttttt aggacttggn 300 

gttaaaggag ccagtggtct ctctgggtgg gtnctataat gagttattgt gacccacagc 36 0 

ttgtgtggga ccacatcact tgttaataac acaaccttta aagtaaccca tcttccaggg 420 

gggttccttc atgttgccac tcctttttaa nggacaaact caggcaagga gcatgttttt 48 0 

tngtnattta caaaatctan cagactgtgg gtatccatat ttnaattgtc gggtgacaca 54 0 

tgttcttggt aactaaactc aaatatgtct ttctcatata tgtgctgatg gttttaataa 600 

atgtcaaagt tctcctgtta aaaaaaaaaa aaaaaaaaac tcgagccttt anaactntnt 66 0 

gagtcgtnta cntagatccn gacatgataa gatcatgatg agtttggaca accncactng 72 0 

aagcagtgaa aaa 73 3 



<210> 553 

<211> 870 

<212> DNA 

<213> Homo sapiens 



<400> 553 

nagttaanag taggtcttgt cttttgcaag atcntancga ttcgaattcg gcacgagtat 6 0 

ataacaactt ttgctttcaa agttgggtgg gactagancn cncantggaa ggntggagtc 12 0 

agganacctg gattnttgng cccgntntgg nttttacagt ntgcctaant ttntgcagtn 18 0 

acttcntgcc ancctgtttc nttacntnca anagggaaag acantccttg gccagcctag 24 0 

ttttnagggt gaacgaaagg tcnttntcac tgcntcctct agtcatttgc ttcttcgnta 300 

attaacacat cttgagcacc tgcnatgttc caggaacagg agatggcanc gtgcaagata 36 0 

aagtccctga cttctagaga ctgcatgtta gtggcaatcg gcgtntaccc ggccttnaat 42 0 

aaactactga atgaaggaaa attctaccta caccagacac aattactggg gtttctaaaa 48 0 

tggaattatt cccccggccc cntgcatcca gcagcctgnt gcagggaaac tcctccnaaa 54 0 

ggcttgtaag gcaaggaanc cgggacaatg gcntggctat ttaagcttnc aacaagatgg 60 0 

ttacccctaa gtncctaatt ccctaacacc aagggggccc tttaccagga aaccaaaacc 660 

aggttaaaaa accccaaagt tgggnaaaaa gccatttgcc anccggggcc nttttaaaaa 72 0 

aaacctttna aaaacctttc ccttttaaaa ctttaccttc aagntaaaan tttaagggga 78 0 

atgggnccaa nttttttaac canccccaaa aaaaanttng gnaatttttt ttcccnaaat 84 0 

tttttnaant tccccaaatt tnggaaaang 870 



<210> 554 
<211> 766 
<212> DNA 
<213> Homo sapiens 
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<400> 554 

tatcaatgnt atgtntggtc tnttcgaaag anctagncgg ntcgaattcg gcacgagcca 60 

acacccagtt ctnactctgt catccaggct ggtgtgcagt ggtgcaatgt gggcttactg 120 

cagccttgac ctccaggaca agtgatctcc cacctnagcc tccggaatag ctgggactac 18 0 

agntcaacaa cgcccctctg aaagtaggac tcttggaaat gaaccttgtt gggagtaaag 240 

ctgaaccttc acctctcctt tccaggattc tactccattc atacggcctc acactgaatt 300 

aatgtttnta gcagccacat cacttngtta cccaattgat ctagtagtaa agtcttccca 360 

tctnttcatg taaaaaaaaa aannnaaaan gggnnaggaa ccntnangnt nnnaanaaaa 42 0 

aaaaaaanca ggnggngngc nttttttaac ctataacctg ntttnaggcc tttccccang 4 80 

tttntccnaa ncnnggttan taggggccna aagctaaccg natttttgnt cccntnaggt 54 0 

taggccngaa attaaccngg gtttaaagaa cncattgant aaagccttgc ctnggccaat 600 

tccgggaaaa gggaanagcc tccttgtttt acanattggg aaaaattggc cccaangggg 660 

gttaaccang tttgcccntt aataactnaa anggattttt gncaaaacct ggttccaagg 72 0 

ntttaanccc aanccttttn aaanntnggn cnctttggat gnaann 766 

<210> 555 
<211> 770 
<212> DNA 

<213> Homo sapiens 
<400> 555 

gttatccnat gngcgtntgt ngnnnncnct aanananttg gctngncgct gggccttgct 6 0 

tctctgagaa aactttggtc acacntccaa agccagggtg ggtgcctccn tgnaggaggg 12 0 

ggctttcctg gttggtggcn cagnaggagt ccaggctttg taccgtggac accatgggct 18 0 

atggcaacac cttcctaacc atccttccat gaggacctcg gnaganagtg gacatgaaac 24 0 

cctttgtgct ctgaancatt caacagaagc tttctggttc tgtgcctatt tctttggcac 3 00 

ttgancgtgt ttgcaggttc attacncaca tgatgaaagc tctggcccat agcactagaa 360 

ttcatgtttt nagggtttgt gagtgtgaca ggtgctatgg tttggatgtg gtttgtttcc 420 

accaaaactc ttgcttgaag tttaactgcc agcatggcaa ttgttggnag gtggggccta 480 

ccgggaggtg attgggtcat gggggcttga accctccgga atagattacn gctgcctcct 54 0 

ganaaagttc tacctgtcat gggggctgga tcagtcaaca ttgannantg gggttgttat 6 00 

aaagcaagac tnactcctta tgcaccgttt ntttgcatat gcccctctgg gggnancttc 660 

tttggctgct aacatttttg gacccaaccc aatgggcctt nacccagaaa nccggaacaa 720 

aatgccnnnn gccattcctt tnngganctt tccaacttnc canaaataat 770 

<210> 556 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 556 

gtngtcnatg anatgtcttg cctnncgaag aacnaggcgn ntcggtagaa cagaaaatga 6 0 

gcatccgatt tcttcactaa aggagaccaa actggttcct tgcggcctag tnttnaagan 12 0 

ctggancttg aaagtcctcc ttntaccaac tccacntcca ccccntnatt cccnttntcc 180 

caaagtncta ctgntgttgc ntgacanccc caaatntgtn ctgtcaacac aaacctgcct 24 0 

ttggngtata aacagggcnt tacagaatgg tncaccctat atatttctgt tcagtatcca 3 00 

ttcactagtt cttcattaat aaatatcatc ttccccattc tgctgctgaa tgccacacat 360 

ccatccagtc tgagaaagtg agagaggcaa tcatgccaag aacaagccag caaagctctt 420 

tcaccagatg tagactgtag ccctgctgcc ttccctccag cgagtctgcc agcatgcttc 480 

ttcatccttt taatatgtcc tttgcttcct acttccctgn cttccaacat actgtcactt 540 

actctggcag tcttctgctt ttcattaagc ctcaaaatct cctctgtcta cttggcacca 600 
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caagctatgt cctatatatg natttctgga 
aagtttttat ttaccttcat tatttaaaan 
gagccttttt ggggaaatca atacttctct 



cttggcangg atagttcaag gggtcttggc 
gggccttttg gggatgttgg cctntttaag 
taanaa 



660 
720 
756 



<210> 557 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 557 

tcgtcnaaan nnatgtcctg gctatccgca ggatccaggc ggntcgaatt cggcacgagt 6 0 

gatttttttg gttttttttt ttgntnttgn caaaagctta ntcntttcan ttaaaantgc 12 0 

cactantttg actttttaag taaaaantgt agggggtttt aaanctactt tcctnctncc 180 

aaaaantcag aaagtttcta nctttntaaa ttgggaaagc aagcantgtt ttaaaancac 24 0 

tgaaggaatc tctttnttcg ngnccttttg ttaaactcgg tttaagctgt agaccttntt 300 

taaantaaaa tttaccacag aacaggaaat agaanctgtg gaagactcga aatacacctt 360 

tgtncttctc tgttcttcac ctgctctctc gctgtctcta cacacacaca cacaaacaca 420 

cacacaccta tatttgcatt aaaaatgggt agtaaaagca gtgaagggca aacagaaggt 48 0 

ccattncatc aagtaagagg ttgaatataa actggaccaa gtcttaattt tttatttcct 540 

tcattcggat ncgtttacta atttctttgc tagctttaag acttttaaaa cattctttgg 600 

ccctgggagg gagttgttta cccctaaact tggagaatcc tggccctaga ataaatgttc 660 

cttttaaacc cccanggccg gaaaattgaa tncngctgtg ccaaaaagga aaaaannnaa 720 

aaaaaaactc gnggcctnta na 742 

<210> 558 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<400> 558 

gggtcnntaa tntnnagcnt gtnaaacccc tgagncttnc gggncgttca caaagaaaca 6 0 

tttaataggg acttncaanc aaataattnt cggtttntca ggtggcagca agacaagatg 120 

gtggatcccc atgccattac ctgctagact cagggttnat atactgtagt ggaaaggtga 180 

ttccgaagga atgttgtaag acaattgaag tgcagtanca tcaaagttat ttgacctaag 240 

ggcaggagtt ncagtaagta tccactttta tncaagaaac antagataaa ctggaaatct 300 

tggagccctt cctggaactg gggttaatga gaagtcaaca tggtggatta ncatggaaga 360 

tggagttgct tagtctccca ttcaagatgg agtttcttta gcctccattg atagggagtn 420 

tttaacaaaa ncangaaata agtctttgat ccattgaatc tctaagagtg agcccttgat 4 80 

gactcaggtt taaacagtnc tgagacaatt taggagatag ttttgaagnt caatttgaat 540 

tgtaaaaggt caggattttt taactttttc acatctttga anaaaagccc atagagcgca 6 00 

agttttcagc aaganctgga aancnatatt nctatggaat taaatagctc ctcagggcaa 660 

tcaattnggc ctggganaac ataatgcttc aanggctgan gnaatctgga atttctatgg 720 

gatttcttca 730 

<210> 559 
<211> 743 
<212> DNA 

<213> Homo sapiens 
<400> 559 

gttagtctat aangtnngnt atgtactngc cctttccgnn ggatcccntc gnttcgaatt 60 
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cggcacgaga ggaaacaccc ccttataaaa ccatcatntc aggctgggtg atctgacaga 12 0 

gctagacact gtcaaacaaa caaacaaaca aacaaaaaaa ccccatcaca tctcatgaga 180 

cttatttact atcatgagag cagctcagga aacacccact cccgtgattc agttacatcc 240 

cactgggtct gtcccacaaa ttgtgggagc tacaattcaa gatgaggttt gggtggggac 3 00 

acagccaaac cctatcacca tgtaaaataa tatctaattt gtagagatta aagaacaaga 360 

taacttaaat cttggatgta agttaagaga gtggtggtca gagttaaatc attttaaggt 420 

tcatttattg tctggacaag aataaaattt tgattatcag gaaatacaag taaaaccaca 480 

gggagacatt gnttatatcc aaattgtcaa aaattacaaa gtcttataat accaagtttt 54 0 

gctganggtg tggagcaaca gaaacttttg ttcactggtg ggtatataaa ttgaataatt 600 

tcagcttgga cattacctag caaaattgaa ggctgtatac gtacatacct accaatctag 660 

caattcactt ctagatatta agtcttgaaa aactcacatg tttccagaga cgtgttaaaa 72 0 

ggtggttaaa tcattntgng aat 743 

<210> 560 
<211> 833 
<212> DNA 
<213> Homo sapiens 



<400> 560 

atccngttct ntannnngtc tngttctttc tncacgatcn nntgcgattc gaattcggca 60 

cgaggggtcc tggtgggagt tccatccagc agtgagtgca ttttttcccc agagcagtta 12 0 

agggtcttat taaaagccac cactttgctg aggcctgtac aggccttggg ggtttgggga 180 

agagaantaa ggcaggcact tgtcccttca gggagggact tgtccntact gggaggtttg 240 

gggttgacct tggctccagc agagataccc agcctggcnt ggaagggcag gtcttgagct 3 00 

tacgcttgac tgcaagggca agctgcaggc ctcttctgcc ttcccctgca ttcaccaagg 360 

acaagtagga ccaagaagtc aagggaaaag tgccaagata gatctattcc catttctttc 420 

ttccacctgg agaattcctg agctatgctt caaacctctt ttgggccagg gaaagactgg 480 

gggacatttt ttagtcaagg atgctttaag aaagtaaatt cctgcttggg ggcccaggcc 540 

ttctttttca agggcttgct tgtgaatgcc caaccaaaaa aaaggggccc ccaaggccca 600 

atcccttact tcctnggtcc ccccaaaaag ggatnccaan ttggggaatt gggaaaactt 660 

gggcanncac ccnaanccca ctttggtagg anttnaccaa cccaaccaac ccaaaaccan 720 

cccacccaaa ttnaaaaaaa ggccaaaacc accaaccaac cnaaacccnn annnnnnnnn 780 

nannnnnnnn nnnaaaaaaa ctttgangcc ttttaaaaac tntttngngn ggn 833 

<210> 561 
<211> 773 
<212> DNA 
<213> Homo sapiens 



<400> 561 

tagtctaatg tnnnaaantn ngcnctngtt ctttctgcag gatcccatcg attcgaattc 60 

ggcacgagga agaggaggct gtgtatgagg aacctccaga gcaggagacc ttctacgagc 120 

agcccccact ggtgcagcag caaggtgctg gctctgagca cattgaccac cacattcagg 180 

gccaggggct cagtgggcaa gggctctgtg cccgtgccct gtacgactac caggcagccg 240 

acgacacaga gatctccttt gaccccgaga acctcatcac gggcatcgag gtgatcgacg 300 

aagctggtgg cgtggctatg gccggatggc cattttggca tgttccctgc caactacgtt 360 

ggagctcatt gagtgangct ganggcacat cttgcccttc cctctnaaca tggcttcctt 420 

attgctggaa gaagaagcct gggaattgac attcagcact cttncaggaa taggaccccc 480 

agtgangatg aagcctcagg gcttccttcc ggcttggcag actaacctgt caccccaaat 540 

gcagcaatgg cctggtgatt nccacacatn ctttcttgca ttcccccgac cttccagaca 600 

gctttggctc ttgcccctga caggatactt gagccnagcc cttgcctgtn ggccaaaccc 660 
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tgaattgggc cacttgccaa acttgcnggg gaaagggttc cttgaaacaa gggggccatt 72 0 

tttggggaag gcttcttggc ttggcctttt ggcatttnaa tttggccttt ttt 773 

<210> 562 

<211> 655 

<212> DNA 

<213> Homo sapiens 



<400> 562 

nnatanacat taangnnaga ngntgagnan ttnccntcgc tctntganna naaggcgncg 6 0 

cgaattcggc acgaggccac cggtctcttc ctaatctgca cattntattt tgggtatttc 120 

tgggcgggca gttcctttgc atgtttcggg agaggtttgt tgatttgggg cttatatgtc 18 0 

aggcctttgg tttgcgtctt attttagggg ttgtttgggg gcctgggtgg tcggcctcac 240 

atgggaaggg gatgggtagt ggatggggtt tctgtcgnat cttgnggccg gtgattttgc 3 00 

tnncgnnctg tttcacattc ttccccctcc acaagccaaa tcgttcattt ggntncactg 3 60 

tgtggactgt ctgagcttgc cctgccagaa aaatttgggg ctaggcaccc aggtgcanac 4 20 

tttggaagaa gcantccacc tgtgggtacc gcatctcgtg ngtcccactg gcaggctgaa 480 

cctacttgaa catggaaaca gcatgcccat atggcaaagg ggccnnnacn nnngnnnaaa 54 0 

tnnannannn ncngacannc nncnnaatca ngannntcna cannnatcnn annnnancnn 6 00 

nncaantacn ncnaaaacac accnnccana annnnnaann nnnnnncann nnnac 6 55 



<210> 563 
<211> 738 
<212> DNA 
<213> Homo sapiens 



<400> 563 

tnntaatgct ggaattcctn atncttgggc tactcgttct ttctncagga tcccntgcga 60 

ttcgcagaaa agagtatagt aggggatgac caaggtcaaa gtgggtaaag aagactcatc 120 

atccactgag tttgtagaaa aacggagagc agctcttgaa aggtatcttc aaagaacagt 180 

aaaacatcca actttactac aggatcctga tttaaggcag ttcttggaaa gttcagagct 24 0 

gcctagagca gttaatacac aggctctgag tggagcagga atattgagga tggtgaacaa 3 00 

ggctgccgac gctgtcaaca aaatgacaat caagatgaat gaatcggatg catggtttga 36 0 

agaaaagcag cagcaatttg agaatctgga tcagcaactt aggaaacttc atgtcagtgt 42 0 

tgaagccttg gtctgtcata gaaaagaact ttcagccaac acagctgcct ttgctaaaag 4 80 

tgctgccatg ttaggtaatt ctgaggatca tactgcttta tctagagctt tgtctcaact 540 

tgcagaggtt gaggagaaga tagacccagt tccatcaaga acaagctttt gctgactttt 600 

atatgttttc agaactactt aatgactaca ttcgcttatt gctgcagtga aaagngtgtt 660 

tgccatcgat gaatgctgca gaaatgggaa gatctcaaat tctttgctca aaaacgtgaa 72 0 

cttaaccaaa atgatggt 738 



<210> 564 
<211> 798 
<212> DNA 
<213> Homo sapiens 



<400> 564 

nggggngtct aatgctgcnc nnatcnannc anggnnctcg ctctngctcn acnnanaagg 60 

cgntgngtgt gccaccacac ccagctcatt attattatta ttattattat tattttgaga 120 

cgaagtttca ctcttatccc ccaggctgga gtgcaatggt gcgatactgg ctcactgcaa 180 

cctctgcctc ctgggttcaa gcggttctcc tgccttggca ggcacctgta gtgtcagcta 240 
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ctcgaagctg 
ctcggctcac 
aaaaaaactc 
aagatccatt 
ttgggaaaat 
acaagttaan 
gttttgggaa 
gtncccaann 
aaancgnggn 
cacnnanatt 



aggtgggaga atcgcttgaa cctggggggc ggagattgca atggtgtggt 
tgcactcgag cctggcgaca gagcaagact ctgtctcaaa aaaaaaaaaa 
gagccntnna actattngng aggtcgtatt acgtagatcc agacattgat 
gatgaagttt gggccaaacc ncaacttgaa tgcnnngaaa aaaagcttaa 
ttgggaatgc ctatngcttt atttggaacc ctttntaagc tgcaantaaa 
caccncccaa ttggcntcca ttttaatgtt tncagggttn aggggggaag 
ggttttttna aattcncggg ccnnggggnc ccaatgcttt ggggccccgg 
ttttgggncc cttttaangg gngggnttan attggccccc cttgggggna 
anatacctng gtcccctgtg nanaaatngg nttcccntta caaaatttcc 
tnngnncc 



300 
360 
420 
480 
540 
600 
660 
720 
780 
798 



<210> 565 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 565 

ttntnngttt naatnntcnn ggnntcgntc tnnctcnaan nanaataggt ttggcgaatt 60 

cggcacgagc atgctggcca gcatccctgc ctgtgcaagc tctggatgag ctgtgtgccc 120 

ctgccacnca cacccngcac tccctgccag cctggcctca gggcctctga tccatgtgca 180 

ctggagtggt gatgactgac agggccactg gggcatttnc acgttaacag cagctgccac 24 0 

tggcaaaaga agtgactcgc caatggtggc atctcagatg tgggcccagg agtctgggga 3 00 

gctactttga acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc 360 

atgtgctgga gtagtcaagg gaaataagac actctccttg tccttgttaa ctcaatcaac 420 

aagcatttgc agagcaccgc ctatatgccg gcgctgtccg aagtgctgaa gatacagcaa 480 

tgagctaagt aagcactgac ttcgtagaaa accataacat cggccatctt tggaaaagag 540 

aaaaacaatg gagttactta tttaaaaaaa aaagaaagaa agttatctct tccanganag 600 

gctagaagta cttttctgct ttttggccag tgcccantgg aatgcctggt ttgggggaag 66 0 

aagaagggac tgggttaact gtggtgcttt tgttgtaaaa aggcanctgg cctttgtact 720 

tgaggagaaa natggagcct tggg 744 

<210> 566 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 566 

gnagtnntat tgatttntct ccgtgaatcg ttctrmctnn annanaagtg ngttnngccg 60 

ctggctatgt ggacgctggg gcagagccag gccggagtcg aatgatcagc caggaagagt 120 

ttgccaggca gctacagctc tctgatcctc agacggtggc tggtgccttt ggctacttcc 180 

agcaggatac caagggtttg gtggacttcc gagatgtggc ccttgcacta gcagctctgg 240 

at 99gggcag gagcctggaa gagctaactc gtctggcctt tgaggtaatg gggggtggcg 3 00 

gtggtggggg gtgcttantg gctatgctca ccccgctnca ttangcctat tttggtctgc 360 

tgtttccaaa tgcttctana tctaggcatt tggtatccaa cctattgcca cantgcctan 420 

aactncanac ccccngccnc tatgntnana cctacttggc acaagaacaa nngnanacnt 480 

tgnnnatatn ccanaangnn naanattaca nantnttata ataccaattn ntnttgangg 540 

tgttnnnnnc anaaacnttt gntnacngnn nnnnntatna atnnataatt nnnnntttgn 600 

nancannanc tatgnnnaat taaangnntn tntncnnnnc nnnacnnnna nnnnnnttan 66 0 

nnanttncnn ttnnnntnnn nnnnnnnnnt tnaanaannt nnnnnttnat nnnannnncn 720 

nctnnaangt ntntttnnnn nnatnnnnnn nnnncg 756 
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<210> 567 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 567 

gnntgtnntt nncnnannnn anganagagn tactcgctct ntctctacga tanantgngt 6 0 

tncgaattcg gcacgagatt tcctccagtc ctgggcccca tccttnaggg ccttcccagc 12 0 

cagccagcag gagaggcaag aactggggga acacaggaac ctaggggagg aggggagcgc 18 0 

tgggcatcct caggctggcg gccaagcctg cccctggagg cactagagga gggcatctgt 24 0 

ctgtgggagc ccagagctgc agggaggagg aggagggagg tatctggtgt gagcgttgcc 300 

cctgcgacat ttgggaccac acaggtgggc ttccttattc cctgacaaag cctctgtttc 36 0 

cagctcttcc gccctctctg gatgagggaa cagaagtgga ggaaacaaaa gaagcagcag 42 0 

cacgcacagt cctgtcgctg ggtgcggaga cagcctggca aagtcccact cagccatggc 480 

ctgatgcang ccccagccct nctttcttgg gtgtcaaatg actgtgtcct ggacatctga 54 0 

tgcaccacct gccctgcctg ttgcaaacgt gatgctcccg gatggaatgg agaaactagg 600 

agactgggac aagcaaaang ctgcaaacaa cccagaaccc attcttagaa nactggagaa 66 0 

atgattgagg aatcattggc accgtggncc tgtgcttcat nacaaacacc tttnagaaca 72 0 

acttgggatt gaaaaaccaa gacant 74 6 

<210> 568 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 568 

gnnnntngtn gttcttanng ttnggatctc gttctttctn cacgatcncn tcgattcggt 60 

ctgggcagcc tacgctttcc ggataaaaat ggcagaatga aagaaattat gagtggaact 12 0 

agagaatagg aaagacatga accaacgccc aaaatgagaa agaaggacat ataaagaaaa 18 0 

agacaaatac aagtgaaaaa aatagactaa tggattaacg tccctgtcgt gtgacatttt 24 0 

ctgctatgga aatgatatta gacaaaaagc acttcaagtg gttttcttat ttgagttcaa 300 

aatgggtcat aacgcagcag agataacttg aaacatgaac agcgcatttg gcccaggaac 36 0 

tactaacgaa catacagggc agctgtgatt caagaagttt tgcaaagcag actagagcct 42 0 

tgaatatgag gaacacagtg gccagccatt ggatgcttca cttcttgaag catcttgaca 480 

gctttttgca ggtgaaatgc ttncacacca gcaggatgca gaaaaatgct ttccaagagt 54 0 

ttgttgaatn cagaacatgg atgtttatgc tgcaggaatt aacaaattta tttctcgttg 600 

gcaaaaaagt gttgattgna atgggtccta tttgattaat aaagatgtgt ttgagcctaa 660 

aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 72 0 

nnnnnnnnnn nnnnnnat 738 



<210> 569 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 569 

gtttntgant ntgattctta tgcttngnct aatgctttnt ctnnangatc ccnncgattc 60 

gctggaggag aggagctcag agttctacag agtctttnct gaacaatatc agaaagctgc 12 0 

tgaagaggtg gaagcaaagt tcaagcgata tgagtctcat ccagtctgtg ctgatctgca 180 

ggccaaaatt cttcagtgtt accgtgagaa cacccaccag accctcaaat gctccgctct 24 0 

ggccacccag tatatgcact gtgtcaatca tgccaaacag agcatgcttg agaagggagg 300 
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ataaaaactt tcagaatgag caaaacacca tcaacgttaa ttccagagat ggaacatttt 360 

ttttcctagt gagaaaacaa cccatttgaa gagaagaccc taatgagaag accctaaaga 42 0 

gagacatcaa gaatggattc agcagaatca tttcacgttt tgaacagcag cagtttgaan 480 

ggccaaagcc tttgatcagg gatcccgtca ttaaaggaca ctcttgagta ttagtaaacc 540 

ctcttatgat gattaaaaga gaagggcagc cctnttcacc tttttggtct ttctattcaa 600 

cttgcctgac cataaaatgg ttctcttctg nacaaagccc catcatttgg tgaacctcac 660 

ccttaacaaa gtaggattgg ggttgggggg cttaattaat tggaatgggg ccaaggagaa 720 

gagcccgaaa ccttagatnc cangggnana agt 753 

<210> 570 
<211> 832 
<212> DNA 

<213> Homo sapiens 
<400> 570 

tnatnaataa ggtttgantt cttatgcttn ccaanngctt ggacctannt anccangcgg 60 

tgcgaattcg gcacgagcca ggccccaata atctgggntt naaactttga ggaaatgcca 12 0 

gtgacttatt ccagagtgcc tcagttaggg gaacttctct gtaaagaacc ctgggtattg 180 

agcaaaaacc ttattatcgt taatgaccta taattggaag cttcctgcct ttttctttgg 240 

ttgctcctgt ggaaaatact gaaaagatta ctttgtttta ttttgttgtc tttttataaa 300 

aggggaggtg gagagacccc ttcagagcag ggattgtgcc gggagagtgc ctctgacttt 360 

gggacatttc atccacagaa atttncaagc caatggtttc ttttgggttt tgggttttta 42 0 

tgtttgnttt ttgggggttt ggaaaaacat gcatttttac cgtgcacgta aaattggtca 480 

nagaaaaagg gagcccagaa aangcagcan atgggccatg cccctttgct gggttttcct 54 0 

tttcttttgg gactgtnaag gggaaatggg tttttanaag gtgaaggttt ggtcctgttg 6 00 

gaaggaaaag aantgtctct gttngggggg acaanaaggn acccttgggg gaggtccatt 66 0 

cgcaatggtn cctaccaaaa cnnggntctt taanaacacc ngggcctttg ncccaggnaa 720 

aaaaccctgg gcccctttaa naaactttgg nangggaacc ccggaaaacc cccttgggcc 780 

ttnccaaatc ttttttccca aagncncccc cggggggccc aaaaaaaaac ct 832 

<210> 571 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 571 

agtnttaatn ntggacttct aanganttng gctnntcgnt tggaannnnn cagtnctcta 6 0 

nnagcccatc gatgcgaatt cggcacgagg ctaggattac aggtgtgagc caccatgccc 120 

agccacttat ctttaaagga ttaagtttat gtttcctact atgggaaacc atcccacccc 180 

aaacttgatg accgcattat gtgcttttat agaacatggc acttctccag gatagcattt 24 0 

attctgtttt gtaagtgtga atgtaattac cctacacaca gcatacacat aatcttcata 300 

ttctttgcct tgtcttgtga aggcaagggc catgtctatc ttattcgtca ttagattccc 360 

acatccaaca tagtcctggg gacagcacca atgcactttt ggtgcataag caaatagtgc 420 

atttatagct cttacctaca atatctgata gactaatcaa atatagtagg ttatctgggc 480 

ctttttgatt catgtctcta gcttaacttt catttttttc ttatttggta tctctcactt 540 

tgccttttga tatactctta cagtttcgct cactgagtaa aagaaaatnt aaacagcaag 6 00 

aagtaaactt gtgttttatg gatttngata acatcttcta aaagaccccc caagattgtt 660 

gatgtctaaa aaaattaaag ggccttcaac tcataataat acttaatagt tcttaaaata 72 0 

ttacaaactg attggaacat tgcctaac 748 

<210> 572 
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<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 572 

agtcttatta nnnngttcta atcctttctt aangagnnta ggctactcgt nctttctgca 60 

ggtatcccnt gcgatncgaa ttcggcacga ggctgagcac ctttggaaac aacatttaag 120 

ggaatgtgag cacaatgcat aatgtcttta aaaagcatgt tgtgatgtac acattttgta 180 

attacctttt ttgttgtttt gtagcaacca tttgtaaaac attccaaata attccacagt 240 

cctgaagcag caatcgaatc cctttctcac ttttggaagg tgacttttca ccttaatgca 300 

tattcccctc tccatagagg agaggaaaag gtgtaggcct gccttaccga gagccaaaca 360 

gagcccaggg agactccgct gtgggaaacc tcattgttct gtacaaagta ctagctaaac 420 

cagaaaggtg attccaggag gagttagcca aacaacanca aaaacaaaaa atgtgctgtt 4 80 

caagttttca gctttaagat atctttggat aatgttattt ctatctttat ttttttcatt 540 

anaagttacc anattaagat ggtaagacct ctgagaccaa aattttgtcc catctctacc 6 00 

ccctnacaac tgcttacaga atggatcatg tcccccttat gttgaggtga ccacttaatt 660 

gctttnctgc ctccttgaaa gaaagaaaag aaagaagact gtgtttttgc cactgattta 720 

accatgtgaa actcatctna ttaccctttt ctngg 755 



<210> 573 
<211> 743 
<212> DNA 
<213> Homo sapiens 



<400> 573 

cangtctaat gctggctctn atcggttctt nnnantnaag ntactcgttc tttctncang 60 

natcnnntgc gntncgctca cacagcatgt gtcagatcca tggggtagga gtcggccaga 12 0 

gacttggtaa cagacagatt gctggatccc acccctagac tctctgattc agttagtttg 180 

gggtaaggcg caagactgaa tttttcacaa gtttcccagt ggtgctgata cttctggtcc 240 

aggaacttag tggggagaga acgactaatc tagaccattt cacttcacat tctgagcttc 300 

ttgtcactgt cacactgcat ccttttaaca atgcattccc tatcctattg caatactgac 360 

atctcatcaa tattttaaaa catgcgtttt cagaaacaat attttatatc aaatactcac 420 

ttttagtaat atttctgcaa ttttgcccta tggatctgag atctaacaaa tactattctg 4 80 

gacatgggct acaacagttg aggctggaag taaaaatgtt aaaccctgct gaccacgtta 540 

ttttaaagtg tattttagtt aagaataata tggcttagga gcagggctaa acagtagcag 6 00 

tcacatgggg aatgatactt tgcttttgca cataaaatgt cctgaaggga aaaaataaag 660 

cagaaaattn ncagatgaac tgaaaatctg tacaaatgtt gggctgaata ctgccagcgt 720 

tgangtgtag gaaaatgaac cnt 743 



<210> 574 
<211> 737 
<212> DNA 
<213> Homo sapiens 



<400> 574 

ccgtctaatg ctggnttcta atcgctttct taangctcnn gggctcgntc tcnctncacg 60 

cagcccggcg gtgcgaattc ggcacgaggg gattacaggc atgacccacc gcgcccagcc 120 

tgtnatttct tatactntgt attttggnct tgtattatgc ttctgatacg ctataattat 180 

ttatgtccat gtncntttct tcaatagact gtgaactctt cgaatgtngg actcctagag 240 

ctagatnctc nattattnnn tattaaattg aatgacttgn aactacagat cctttattta 300 

aacttcccaa atttctgctt tatctaggcn actctttaaa ttcttttatc tcatgtagat 360 
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ttcanaggct gaaataattg agatttttag tttgaagaaa agagaactgn ggatttaatg 420 

gcnttattat tatattttta atggctgttt gggagtnagg ttgcagacat tggtcacttt 480 

cctcctaaat ncttaaatat ttcctaaaaa caggncattc tttnttttnt tatggagtct 540 

ggctctggcn tccaggctgg antgccgngg cccatcttgg cttactgcag ctccccttcc 600 

cgattcncgc tggtctcctg nctngctgct cgggaggctn aggccnggga atcgttgacc 660 

ccggaggcgg aggttncnan agcctrmacg ggccctnggn ctcccggctg ggtacnngac 72 0 

cggacctccg nctgnat 737 

<210> 575 
<211> 766 
<212> DNA 

<213> Homo sapiens 
<400> 575 

gnagttnaaa agcggntttt antcctctcn aatcngnttg ggctactngc tctttctgna 60 

ggnatcccat cgattcgaat tcggcacgag ctttctccct ctgtgcctcc tgcttccttt 120 

ctctctcctg cctctcctct gctccccatc ccactttctc atctgcctcc ttttctcact 180 

tctgtcagtc tgtaagcttt gataacctgc ttaatactcc aaagtgtgag ttcctctgat 240 

ctcttgattc cttagttcta atctcacgtt ttgtttttaa gagatggagt ctctcactct 300 

gtggcccagg ctggagtgca gtggcatgat catagctcat tgcatccttg aaatcctggg 36 0 

ctcaggtgat cctnccgcct gagcctcctg agtatctggg actacagatg cgtgccacca 420 

agcctggcta attttgtctc atgtcttcta aaaattattt tgtgaagccc cttcacaaaa 480 

aaccttaang gaaatctgat ggtgctcagg aatctaactc tccctaaacc atcctctttt 540 

aactgcttct aaaatatctc tggtggcctt tcttagcctt tttctggttc attcaatgct 600 

tcaaagcgct ttttgnttct aagttgagtn ctttgggggt ttgacaggta gtgacgtgta 660 

gttttgacac tgttaacttg ttnaatacag tgaaaangtt tgtgaagtga aaaatgcttg 720 

anaaagaatg gnaatgcctt tntacaaata aaagtnttgt taaaat 766 

<210> 576 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 576 

ggggtnnnna gngnnttgan cccctttctt attatcaagg ngctngcnct nnctnnannn 6 0 

ancacaggcg ntgngaattc ggcacgagaa gataacctct taatgcattc atgttgtata 120 

tgaaggaaat gagagcaaag gtcgtagctg agtgcacgtt gaaagaaagc gcggccatca 180 

accagatcct tgggcngagg tggcatgcac tgtccagtag tatttattgc tttagagatt 24 0 

gcttgtcgta cctgtatgtc gtcccttttt aaatatgttt tcctttttct tgaaactgta 3 00 

taaagttttt ttccccctta gcataagcat cttatatata acaactcatt tgtacaaggt 360 

ttttaagttt atatataaaa tgtgtatata tatttttgnt tccccttttt gacttttttt 420 

ttctgtatga aacccagatg tcaccaaatg gacattaata gttgcattaa ggatcagtag 480 

cattaacaaa agttgcttta aaagccatta tgtaaaacaa gacttgaaaa tgagtgaggg 54 0 

aattttagcg acactgtctg agcacagtgg gaaccatctt cgtttcccct ttgaactcca 600 

antgggatgc cctaccctgg cgccccttag gaccccggac tggcccgngt acaaaacttt 660 

accgtgccaa aattcttaag tgaatttacc tttctncctc tttttgaagc tngaaatttt 72 0 

tggtcatcan gntttgcttg tgatngtaca tanggtngaa n 761 

<210> 577 
<211> 803 
<212> DNA 
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<213> Homo sapiens 
<400> 577 

gggtngttnn nnngtggnnt tnttnnnngt ttctaatnnt cgnggngntc ganctnnctc 60 

nananagaat aggtttgnga attcggcacg aggtctcccg cccggcgccc ccagtgtttt 120 

ctgagggcgg aaatggccaa ttcgggcctg cagttgctgg gcttctccat ggccctgctg 180 

ggctgggtgg ggtctggtgg cctgcaccgn catcccgcag tggcagatga gctcctatgc 24 0 

gggtgacaac atcatcacgg tccagccatg tacaangggc tgtggatgga ctgcgtcacg 3 00 

cagcctctag aactatagtg agtcgtatta cgtagatcca gacatgataa gatcattgat 360 

gagtttggac aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt 420 

gatgctattg ctttatttgt aaccattata agctgcaata aacaaagtta acaacaacaa 480 

ttgcattcat tttatgttca agttcagggg gaggtgttgg aggtttttta aatnnncggc 540 

cncngcgcca atgcattggg ccccgtaccc acttttggtn cctttaantg aagggtttaa 6 00 

tttgccccnc tntgccgtaa ttcatgggnc atanncttgn tttcctggng ttgaaaattg 660 

gntaatcccc ttcnacaaat ttccnccaca atcatttacc aaaccccngg gaggcctttn 720 

aaagnngtna aaanccctgg gggtggccct taatttaagt ggnnccttaa ctcncnttta 7 80 

antgccnttg cccttcactg cct 803 

<210> 578 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 578 

tcgtcccntn gatcggggta acgtccttnc ctatnaaant tctttcggga aagcagaaac 60 

caagctggca gaagcacaga tagaagagct ntcgtcagaa aacacaggag gaaggggagg 120 

agcgggctga gtcggagcag gaggcctacc tgcgtgagga ttgagggcct gagcacactg 180 

ccctgtctcc ccactcagtg gggaaagcag gggcagatgc caccctgccc agggttggca 240 

tgactgtctg tgcaccgaga agaggcggca gatcctgccc tggccaatca ggcgagacgc 3 00 

ctttgtgagc tgtgagtgcc tcctgtggtc tcaggcttgc gctggacctg gttcttagcc 360 

cttgggcact gcaccctgtt taacatttca ccccactctg tacagctgct cttacccatt 420 

ttttttacct cacacccaaa gcattttgcc tacctgggtc agagagagga gtcctttttg 4 80 

tcatgccctt aagttcagca actgtttaac ctgttttcag tcttatttac gtcgtcaaaa 540 

atgatttagt acttgttccc tctgttggga tgccagttgt ggcaagggga ggggaacctg 6 00 

tccagtttgt accatttctt tgnatgtatt tctgatgtgn tctcttgatc tgcccccact 660 

gtcctgtgaa ggacagctna ngncaaggag tgaaaaactt tacttcttaa aaaaaaaaan 720 

nnnnnnnnnn nnnnnnaa 738 

<210> 579 
<211> 758 
<212> DNA 

<213> Homo sapiens 
<400> 579 

gnngtgncta nctaaatnnt tggntntaaa cgtncnttct gcatnatccc tnnttgacga 60 

attnggcacg agacagagtc ctgaaatatg caaatgaagt aaattctgat gctggcgcct 120 

tcaagaaaca gcctaaagga cctgcctgat gtgcaagagc tcatcactca agtgcggtca 180 

gagaagtgct ccctgcaggc cgaagccatc cttgatgcaa acgacgctca tcaaacagag 240 

accttctcct cccaagtcaa agggacaaat aagcctctgg gttgaacggt ttgagacatt 300 

ctgccttgga ccttcccttg tcaccaaaca agccaacctt gtgcacttcc accaggcttt 360 

cacccattcc ctgcaagcct tggtcttttg acctggccct caaccatgtg gctttccacc 420 
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ccttgaggac aagttggaac agaagaccaa gagtggcctc actggataca tcaanggcat 48 0 

ctttggattc aggagctaac caggctcttn ctcgggggcg ggggagattc tgactcttaa 54 0 

tctggattgt gagaaaaatc cagcaagttc catgatattt aaatccaggt ctgcattggc 600 

ccggggcaag agtttaacat cttcgggccc tgcatttcct acatcttggg gtctgtacac 660 

gttcttaagc aagcgtgtca ngagagcacc ctgttggctt cttggtaaaa tgtgtgcaag 720 

gtcatnctgt cttctgnacc ttctggggaa aagggncc 758 

<210> 580 
<211> 816 
<212> DNA 
<213> Homo sapiens 

<400> 580 

tttctaaatn gcttgggttt cnaaatccct tggttgacgc cctcgcctaa nntggcgtgn 60 

nantgcccnc gattcgctgn caagtctgga antcatattg gagcctgngt ngactgaaaa 12 0 

ctcagcanga gttgatgtta aagtcttggg tctgaaattn gtngggcagg agattaggct 180 
ggaaactcag gcagaatttc tgtgttacaa tcttgaggca taattcttct ccaaaaaaat 
ctccattttt ttctcttaaa gccttggatg agccttggat gattggatga ggactaccca 

cattatctag ggtaatctcc tttgcttaaa gtaaactcac tgtgttaatc acatcaacaa _ 

aataccttca cagctacatg tagtgtttga ccaaacaact aggcaccata gcctagccac 420 

ataaaattac tatcattata ctttgtctta tcacatactt ctaccttgga agggatattt 480 

cccagttggt atagctacaa aacagaggca gatcatttag cctgcattng attngtantg 54 0 

aaaaataagc ctttggtgng tttaaccact gaaaatgttt gcggcctatt agtantngca 600 

caacttatcc tatnctggcc aaacatagaa tgctttcggt ttgcaaggta acangatccc 660 

ctttacagnt gtacnaaaaa tnancnntaa aaaaactnga gccctntaga acntnntagt 72 0 

ggagtcggan ttaacgttng ancccagacc ntggattang gatncattgg atggagtttg 780 

gacataccac cancttggaa tggcnantga aaaaaa " " 816 



240 
300 
360 



<210> 581 
<211> 868 
<212> DNA 
<213> Homo sapiens 

<400> 581 

ccnnganncn nnccnnnnnc nnacaaaanc nnnnnnnann nnnnnnancn nnnnnnctct 6 0 

tcnaannctg ctnacgccca nagcatgacc cacgatcgaa tcggcacgag gttgcaagca 12 0 

gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtn ccagaatgat 18 0 

acacggatca gngcanaagn tcatcaggct ctcggacctt agggctgntg gagaagcttc 24 0 

agcagcagaa ctgatggtga aggctcgtgt tctccatcct caactttctt tgcttcgatc 300 

atacacaaga atacattngg aagggcaaaa aaatgaacac tgtcgnncat tgcagcccgn 360 

gtttngtgac acagatgcac agtctgcttg tgaagacctt ctctcaagtg gcatttggga 42 0 

gtccatgcca gancatggtg cttcatgaga gactgacagc tatcaggggt tgnggcactt 480 

agngaggact ctcctccccc agtgtgtgct gatgacacat cacacctgac aatagctnga 54 0 

agnctnctct gacccntntt actctgtagc caacatacca catganttta aaacccnttc 600 

taaaatatcc aancaatggg gtcatacntg gcccaaatgc cagaantcna gagcctaata 66 0 

ggacttccaa tnattaactt tnccaaannc gaaaaaagna gggcnttccn nttatggcaa 72 0 

aaaaatnaan nnaaaaggan atntggnatn gttngccnaa aaaaaaagcc cnntnngaaa 780 

cctaatanga ggaggtccca cttaaccggn cgnancccca gaacantgga atacaggant 840 

accnatngga ntgaanattt ggggancc " 868 

<210> 582 
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<211> 745 
<212> DNA 
<213> Homo sapiens 



<400> 582 

ttctgaatac cttnttacnc gccttcttca ggantttcaa gacctaattc ggcacgagac 6 0 

cctttctgcc ttctgtttgg gacccagctg gtgttctttg gtttgctttc ttcaggctct 120 

agggctgtgc tatccaatac agtaaccaca tgcggctgtt taaagttaag ccaattaaaa 180 

tcacataaga ttaaaaattc cttcctcagt tgcactaacc acgtttctag aggcgtcact 240 

gtatgtagtt catggctact gtactgacag cgagagcatg tccatctgtt ggacagcact 3 00 

attctagaga actaaactgg cttaacgagt cacagcctca gctgtgctgg gacgaccctt 3 60 

gtctccctgg gtaggagggg ggggaatggg ggaagggctg atgagacccc agctggggcc 420 

tgttgtctgg gacccttcct ctnctganaa gggaggcctg gtggcttaac ctgggcangt 4 80 

cnngtcttct ctgaccccan tggctgcngt gaaggggaac cacccttcct tgcttgacca 54 0 

ntggccatta nctnccntna ccacttgnaa cccanggtcc canctggctg ggaccctntt 6 00 

ntncccccaa ngncctttcc cttgggctnt nttggantga gcaccttctn tgtnngcacc 660 

ttttanaant gnnnnnntgn tactgatttt tttgntaaaa agannttaaa anctggnant 720 

ttntnaaaaa aaannannaa aannn 74 5 



<210> 583 
<211> 748 
<212> DNA 

<213> Homo sapiens 



<400> 583 

gnttctaatn cttggcctac tcgcctntct ncaggatctt atcgatncna attcggcacg 60 

agatatggta tagttggaaa taggttattg tgagttattt gtagtcatgt ctttaatggc 12 0 

ccttgcatgg tgtctaactt ctgcaataaa tgatctgcca gtcctagtgt ctgggcttta 180 

tgcaatttgt tttcctttgt ggatgaagtg ggagtaagac ttgttgctgt gaggatcaga 240 

tgaagtggct aggatatgga cacactttac ttgaattgga aaacaagcca tgtatcccta 3 00 

atctgcaaaa tgtggcatgt cacacgtgta atctctgagg tttagttttt gctcaagatt 36 0 

gcaaaggtga cttgcttgat gctttctttg cttgagcaca catctcattc attaaatggt 420 

gtctcctttt ttgcacacag gatgcagaac ataattgacc ttttccaagt ctacttagca 480 

gaaatgaaaa tggaatcata taaatacagt attatacttt aaaataaaaa ggctgtacaa 54 0 

aagtttggct gacatagctt gcttctagta atctgaatgg cttatttaaa taaagttgga 6 00 

tctatggact cttcacagnc tagatattat cctactggaa gatgtgcctc gaaagctgtt 660 

gaaccacngc aaaaaaaccc ttcagtcagc acgtgagaaa acctgcgagc ccacatttcc 72 0 

cccgggacca ttctgaacat cctactgg 748 



<210> 584 
<211> 773 
<212> DNA 

<213> Homo sapiens 



<400> 584 

tttaatgctt gttacacgcc ttctgcagga tttatcgatt cnaattcggc acgaggctat 60 

gtattgtgtc ctaccatgaa ttcactccat gctagccaca ttggcctgta tggctattcc 12 0 

ttggacacac ctaggatgtt cttgcctctt agcttgccta cctttctctc atcatttggg 180 

cctcancgag gatatcatct cctcagagaa gccttctgtg accatgctat ctaaaatact 240 

ccagcacttc agtcaccctt tatcccatta ctctgctttt tcagaaacat tggtgctccc 300 

tgaaacatat ttgtttactt gcttagtgtc ttttctcccg cactaccatg taagcttctt 360 
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gagggttaag ggaccttgtt agggataacc actgtatcct tagagtgtga cacatagtag 420 

gttctcaata catatttttg aaactctacc ctgatgcaaa agagatatca aataattata 480 

gtttttgcat tataaatggc tttggtgaaa tccctggcac aaaactaata ataaaagaaa 54 0 

taaacagata atgttgaagt tctgggcctg caaaacctaa ctcttttaaa gcagtcccag 6 00 

taaatgtgtc attgggatcc ataagacttt gtgggaaagt caacataatt ttattnggga 660 

aaaagcattg aaccttcaaa agtnaaaact ttatnggncc aaaatctcaa ttactggggg 720 

gccgttcttt aagtcatttt aaaccctttg angccnacag ttttacacca aat 773 

<210> 585 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<400> 585 

ttcaatacnt ntttcnngcc ttttgcagga tcnctcgatt cgatggaaca tgagtggaag 6 0 

tgggcagtct ttttctttcc ctatcagctg agtgaatgaa gatttagagg gcagcagagt 12 0 

catgacatgg atgacgttgg gtctctggat ggctaaatgg aagacccgcc ccccaacgcc 180 

actctacccc cctgctttga actatgcttt gagaaatgag cttatgagac cactgagact 240 

tgggggctgt ttgttcagca gttcacctac acttattagg aaaggttgac ttcttgtact 3 00 

acgcctttcc ttaaatcatc ttttgtataa ttctcagaac actgctggtt tgggtggtct 360 

cacacatttc tcacatccaa attttaaaga tttcatgaat gttcattaca gtggatttat 420 

ttttctcttt ctgcttctcg gcatgccctc tcaatttggg agaaatctct aattggatga 480 

ctttggtggg accacaggag tgtaaggatc gtaattccct cacttcatcc cctgcaaatt 540 

aaagcctggg cacttaagac tcactcaact gaatcttgat atgtgggact ttanatctta 600 

agcaaatgan gcaaaagaag gaaaagacag ttgagaaaat caatctctga agttcagcac 660 

ttgatttcca ccgtggaccg gactcctgca nctttgcatt ngccttggtt cctggccatt 72 0 

ttccnaaccc gggttccttt ttgan 745 

<210> 586 
<211> 749 
<212> DNA 
<213> Homo sapiens 



<400> 586 

tgttctaata ctaggtntac tcgccttttg caggatctna tcgattcnaa ttcggcacga 6 0 

ggggtcctgg tgggagtnnc atncagcagn ganngcattc tttccncaca ncagtnaacg 12 0 

gtcttattaa nagccaccac tttnctgang cctgtacagg ccttgnnggt tnggngaaca 180 

gaaatnncgc aggcacttgt accttcaagn anggacttgt gcctnactgn nagggttggc 24 0 

gttgaccttg gctcnacnga catacccant ctgacttnna acgngcncgt ctnagcttac 300 

gctagactgc acnnccaagn ttgcangcct nttntgnctt ccctgcattn accaatgaca 36 0 

gtacgaccaa cagtcaanga aaagtgccaa gatatatcta tcccatttct tctacacctg 420 

tanattcctn actatgctca aactatgtgg ngcaangaan actggnngac atttttagtc 48 0 

aatgatgctg acaattaatt actggtgngg ccaggcatat nttcacggct gcttgtgatg 54 0 

ccaacnaaga acgggcccca gcccatcctt actcctngnc cccaaanaga tccagtgnga 600 

atgggaagct gnnannacca acccaactnn tgatttacca ccaacnccaa anatcacgca 66 0 

tgnnnacagc aaaacaacaa cncnatgcac ttaacaagna nccnaaaant naactcgngc 72 0 

ctctaaaact attngggant cctttanct 749 



<210> 587 
<211> 783 
<212> DNA 
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<213> Homo sapiens 
<400> 587 

gttctaatnc ttggcctact cgcctntctg caggatcttn tcgaccttat tcggcacgag 6 0 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 120 

ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 180 

atgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 24 0 

ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 3 00 

cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 

attctctcat tttttacagt ttatcagttg cactcatagt tattgtcata atcactcccc 

acagtaacct gtaaggcata taaagtagct attttagtaa gataaatgat attttatata 



360 
420 
480 



600 
660 



tgttatgata agataaatct tatcatttta agaagaaact gagctcggag agatgaaatg 54 0 
acttcctcag ttgctgctgt aataaaagtc tactttttgc taaaaaaaaa aaannnaaat 
atnntntann attnnantaa naanaaaaac ttcgagccnt tttnaaactt tnantggagt 

cnntttntcc cgtaaaatcc nnnactttgg atnaanannc catttngatn aagtttttgg 72 0 

gacaaacccc ccaacttaga aattgcnntn ggaaaaaaaa ntgcntttta ttttgnggaa 780 

aan 783 

<210> 588 
<211> 771 
<212> DNA 

<213> Homo sapiens 
<400> 588 

tcttctaatg ctggttacan gccttctgcn gatccctcga ttcgaattcg gcacgagata 60 

ctttttaaac cttttttggc agctcagatg gtgtaaattt taaaattttg tataggtatt 120 

tcataacaaa aatatgtatt tcttttttgt tattttatct tgaaaacggt acatatttta 180 

gtatttgtgc agaaaaacaa gtcctaaagt atttgttttt atttgtacca tccacttgtg 240 

ccttactgta tcctgtgtca tgtccaatca gttgtaaaca atggcatctt tgaacagtgt 300 

gatgagaata ggaatgtggt gttttaaagc agtgttgcat tttaatcagt aatctacctg 360 

gtggatttgt ttttaaccaa aaagatgaat tatcaatgat ttgtaattat atcggttgat 42 0 

tntttttgaa aagatgaacc aaaggatttg actgctaata ttttattcct tacacttttt 480 

tctgaataag tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatngt 54 0 

gccatttgta aaatnanaat aagagcagaa aaaacacaaa nangagaaca ctggnttcag 600 
acattcantg gggcaagtta aattatggga ctgcaaaaat aatggatttt ttattcaaag 
aaaagcttta aaaagtttta ttatccanat ttacaaccca ctanttaagc taaataancc 

tactttnaaa aatngnaaat ggttnctatc tttataangt gccaanttna n 771 

<210> 589 
<211> 844 
<212> DNA 
<213> Homo sapiens 

<400> 589 

tncactnnaa tccttntnta aaaagccttc tgcntgatcc catcgattcg aattcggcac 60 

gaggccagag cctagaggag agatcaaaga cnttngccga agtgaagccc attctgcaag 120 

caactgggtt cccatggcat gtggtggcct tagaggaggt gttcagcctg ccaccgtcgg 18 0 

tgctttggtg ctctgcccag gagctggtgg gatccgaggg ggcctacaag gcggccgtgg 24 0 

acagcttcct ccagcagcag catgtgctgg gggccggggg tggtcctggc ccgactcaag 300 

gggaggaaca gccaccccag cccccgctgg acccccagaa cctggcaaga ccgcctgccc 360 

ctgcccagac tgaggctctt tcccaactgt tctgctcaat gaggacactg actgccaagg 420 
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aggagcttct gcagaccctg cggacccacc tgatcctnca 
actccaaggt catgactggg gacagntgca cacgcttggc 
tgcnctgggt ccaaagggcc ttcctggcct gggatacngg 
ccgggngaac gtnggtggtn ggtgccggnn cattgcctgg 
angaangntc gnttatttct aacaaaaccc ggnccttgnt 
tggnnnnttt tnaanacncn annncccaat tngnaanaac 
aaaaaaangg ggccnatatn ntntntcana cccgnggnct 
tnct 



cgtggcccga gcccacggct 
tatcaagctc atgaccaacc 
ctttcttgga tgaaccngna 
gaaccaccac ccccttnaaa 
tcntaccntn ttccctntct 
ccnaaaangg gnccctttgn 
ttgaatnngg aaaangccnc 



844 



840 



480 
540 
600 
660 
720 
780 



<210> 590 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 590 

tctaatgctt ggntctngcc ttttgcggat ctttcgattc gnattcggca cgagagaacg 6 0 

ttctcaggtt gaccagctgc tgaatatttc tttaagggag gaagaactta gtanntcatt 12 0 

gcagtgcatg gataacaatc ttctgcaagc ccgtgcagcc cttcagacag cttatgtgga 180 

agttcagagg ctacttatgc tcaagcagca gataactatg gagatgagtg cactgaggac 240 

ccatagaata cagattctac agggattaca agaaacatat gaaccttctg agcacccagg 3 00 

tttggcatag aaatggtacc ccttgttcaa aatgaacaag aagccttaga tttggatggg 36 0 

gaacctgatc tgtccagtct agaaggattc cagtgggaag gtgtttccat ttcctcgtcc 42 0 

cctggcttgg caagaaagcg aagcctttct gagagcagcg tgatcatgga cagagctcct 480 

tctgtgtata gcttcttcag tgaggaangt acaggcaaag aaaatgagcc ccagcagatg 54 0 

gtttcaccta gtaactcatt ganggctgga cagaaccaga aagcaaccat gcaccctcaa 6 00 

acaaggaagt nacacctcng gctggccttc ccttccgaac aggtgaaagg ggcttgaaaa 660 

atgttgctac cccaaaggcg acattnntgg caccaaatta tccctcttga ccnntttaat 720 

accttttgat tncatttngg caaaagactt tgnaccagcc nnggaga 767 

<210> 591 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 591 

tctttgaatc cttttgtaaa agccttttgc atgatccctc gattcgaatt cggcacgaga 60 

cttcttgttt gcctttttta taaggaaatg ttggagagtt acatcattgc taatgtagaa 12 0 

atgttaagtg gaaaaatata cagtttggta aaataaacta gattctacat ttatttgtgg 180 

gtttttttcc cctcctttct ttccacagca cttttgatat caagcaagtg gcttcctttt 240 

tgagatatta aaaaaaaaaa gaaaaggaaa aaagtaaatg aagcccaact acctaaccct 3 00 

ttcttatttg tatttgtttt agtattgtga agttgtgtta aatagtacta gctagaaata 360 

caaatttctg gttatcattt ctcttccctg tggcacttga cattttaatt gtcttaaagt 420 

ttttgaagtc atcttctggc cccttgagta ctgccagagg caaaagatgt ttgtttctta 4 80 

ttcattccac ttttgtctcc tgggatccct tctgtagcct aaagtatggc tgggaaatgg 54 0 

acttgagaag attggcttga attangatca taatcatgtg tgatcccatc atgaattcat 6 00 

tggaatntgg ggtncatgta angcaatcnt tctggtgtaa atcttccttt ttttaatgna 660 

catatanttt tggaaaaaat tttgaattaa ccctgaaaat ttttaaaaaa gccctcttan 720 

aactattann ggaggtcnca ttaccctaga atccanacat tnant 765 



<210> 592 
<211> 757 
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<212> DNA 

<213> Homo sapiens 
<400> 592 

tnttcnaana ctngttctng ncttttgcag gatcccatcg attcgccaaa tctgcctaga 6 0 

gattgagttc acagtgtatg ttctgggggc gctggtgcag tcagcggtcc agtctccagc 12 0 

ctgcaggcgt gcacactggg gtggacgatg ggtggccccg caggtgtaca catttgggtg 18 0 

gccccggccc ctatacccca gtgttctctt tgatccagtc ccgaaacaga gggagccttg 24 0 

tgtacacgcc tncaaagtgg agctgggagg tagaagggga ggacactggt ggttctactg 300 

acccaactgg gggcaaaggt ttgaagacac agcctccccc gccagcccca agctgggccg 360 

aggcgcgttt gtgcatatct gcctcccctg tctctaagga gcagcgggaa cggagcttcg 420 

gggcctcctc agtgaaggtg gtggggctgc cggatctggg ctgtggggcc cttgggccac 48 0 

gctcttgagg aacccaggct cggaggaccc tggaaaacag acgggtctga gactgaaatt 54 0 
gttttaccag ctcccaaggt ggacttcant gtgtgtattt gtgtaaatga gtaaaacatt 
ttatttcttt ttaaaaaaaa aaaaaaaaaa actcgancct ntanaactat tagtgagtcc 

tatttacctt agatncagac atgataagaa tncattgatg aattttggac aaaccacaac 72 0 

ttggaatgca ntgaaaaaaa atgctttatt tgtgnat 757 

<210> 593 
<211> 766 
<212> DNA 

<213> Homo sapiens 
<400> 593 

tcttgaatnc tngttnntgc ctttttcgga tccctcgatt cgaattcggc acgagagaac 6 0 

attggtgtgt gagtgttttt tgatggtgca ggacccggag gtgctttcct tgccaagaat 12 0 

agaaacatcc agaatgctcc tccccatccc ccaatcccag acagcaatta tgtcagccct 180 

gtaaggcatt gcctgctctt gaccctttgg cccatctttt tatttttaaa aaattcccat 240 

gtcacagatg ccctgtctat gcagagggtg gcgtgggatg ggtgaccact aagtttaggc 30 0 

tggtgaaggt ggtgagccct tctgaggccc tgatagaact ttccaggagt tcatggtccg 36 0 

cggctccagc ttctcactgt aaagttgtca tcctggcaga ggcagccaat gcttttcatt 42 0 

ctagggggta gagatttatg ctaatgagtg aatattgcac cactagtgac tttctgttta 48 0 

aagttcagct cttagaaaat ggaatcttac ctgaccccta gtgaattatg tacataagca 54 0 

gggaatgttt ccaactagat ctccttcaga agagtccctg tgctggaata ggtcactgaa 600 

tcttatttgg ntttgtnaaa caaaagcttt tgggtctcgt ggggtgtgtg tgtgntttgg 66 0 

ngtgtgttgc cccntntgcc gtttcaaata aaaggtttgg taccaccttt tcaaaaaaaa 720 

aaaatantnt anntnanant nntntancnt ttnttnncnt tanant 76 6 



<210> 594 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<400> 594 

ttgnttagga tcccatcgat tcgaattcgg cacgagggaa ggcagtggga ggagaggacc 6 0 

aagtctcaaa ctccagaagc cccacctccc tgagctcagc tcctctgcca agccccctca 12 0 

gcgcgaagtc ctcgtccaga gaaggcaacg gcgagaaaca aatccaacat cctgggctgc 180 
tttttccttc ccccactttt taaaagtttg gtgtccaagt cacttgacaa acccagaccc 24 0 

taacaatgat attttgtgta gaattctggg atcaaaatat aatttcaaaa ataatatatt 300 
ttctgacatc ccccaaaaaa aaaaanaaaa aaaactcgag cctctagaac tatagtgagt 36 0 

cgtattacgt agatccagac atgataagat acattgatga gtttggacaa accacaacta 420 
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gaatgcantg aaaaaaatgc tttatttgtg aaattttgtg atgctattgc tttatttgna 480 

accattataa agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 540 

aggttcangg ggaggtntgg gangtttttt taattcngcn ggcgcngcnc caatgcattg 600 

gggccccggt nncccanctt ttggntccct ttaagnngan gggtaaantg ncgcncttgg 66 0 

cntaatcttt gnncatnggt tggnttnctg nggngnaaat tggttttccn ggnnanaatt 720 

tccccncatn ttangatccc nggnngntnt aang 754 



<210> 595 
<211> 767 
<212> DNA 
<213> Homo sapiens 



<400> 595 

ggtttaatgc tgttnnaanc cttcttnanc ctttgtacag catccctcga ttcgaattcg 60 

gcacgaggaa cgcttccatt ttatacctgt gtctagttag tttctgccta tctatccaag 120 

aagcttttat caagggtcca ccatgtgcca gccactgaag tagatataaa tacaaggatg 18 0 

tgtaaggtat ggatgatggt atacgaactg tcatcttact ggatttgtcc gctctgttaa 24 0 

agatacggtt ccgaaaactt tttaaagccc tagagagggc tttaaggcaa tgtagcatca 3 00 

tatatagagg catnaacctg ttcatatctt tctatttaac agaactgtgc acctgggcac 360 

aagggtgtgc acaacaggat gtgtacagca gcactgttaa agtgtancac atccatacta 42 0 

cangatctta tgcaactgtt ggaaagaatg aagcgatgct gcactgtggt catgcagtga 4 80 

tctctaagac atattaactc gaaagcaaaa ggtttaacaa tgtatnacaa actgggctgc 540 

aattgactcg cgcctgtaat cccagcnctt tgggaggctt gantaaggcg gatcacctga 6 00 

ngtcangagt ttgagaccaa acctggccaa tgttggccna aaccnctgct tctactnaaa 660 

ctacnaaaaa ttaacctngg gcntggttgg ctccgtgcct tntaatcccn gcttactcgg 72 0 

caatgcttga gngaangnan aattngcttt gaacctnggg gaggnng 767 



<210> 596 
<211> 743 
<212> DNA 
<213> Homo sapiens 



<400> 596 

tnttnaatnc tnttttaatn cttgctgcan gatctttcga tgatcccatc gattcnctgg 60 

tctcgaacac ctgacctcag gtgatccatt cgncttggcc tctcgaagtg ttgggattcc 120 

aggcgtgagc cactgcggcc agcacatttc cacttntaga tcctactcca taccacaggt 180 

ttcatttaag angaaaganc tanataaatg tgctcttntg gataccccac cctgacagan 240 

tgcattgtta cacagntanc atgggttgac actgcaanct ggcctgtcag ccatnggagg 3 00 

ngtttannga aaggcanatn atgtnactct gtgncagggn gccatntgct tacccntnac 360 

ctagcatang gggnttctac gggtgacccc nagcatattt ctaggttact tatgggcaga 42 0 

tttgtaagtg acaaaactcc agctgatgct gggaatgggg agagggccct tganggactt 48 0 

tgtggntttg tgcttctggt ttcctggcca accccagggt cacttgtctg gagcccagct 540 

gggcactaat gtctgccanc gactatntta cagtgtataa atgattcctc tatttgggga 600 

gagatcttcc aatccagaag agcccctntt ggactgcctg ggttaaatct gcatagcana 660 

a gtggttgat gagtcatctg aagaaattca gccccaactt nncaacctgc ccttcctgnt 720 

tcctttttaa tggnggcctn tgg 743 



<210> 597 
<211> 786 
<212> DNA 
<213> Homo sapiens 
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<400> 597 

ngtttnnncc ngtttttaat ncnttgctac tngctctttt tgcaggatcc catcgattcg 6 0 

aattcggcac gaggacanac cgttgagagg acgtggaggc ccnttagggg gtntgcncng 12 0 

nanaggcaga ngtggccctg ggaacagagt tttatgacnc ttttnaccat anangaangn 18 0 

gagaatttna aagatatggt gggaatgaca aaatagcagn cataactgaa gacaacatgg 24 0 

gtggatgtgg agtttggnac ctngggatcg ngnaaagata ccagtgatgt ggagccaact 3 00 

gctccgatgg aggaacccac agtggtggag gagttccant gcanccngga agaggagtat 36 0 

ccagcctaag ttnctgactg gatgtcaaga agaaacccaa nttataanag atgactntan 42 0 

ntgantggnn aaatctttca gatcanncca gaccatancn tgagtttaac atccgnaanc 480 

cacaatccan tgnnccttac taagccgtgg tgattnacaa gtcataaatc cattanatga 540 

tgtggtnaaa gatgcctatn atgaccnatt ctccatngtt ntccngaaac ccgtcaattg 6 00 

acatcacatn tcctnttgga gattaaattt tnggtnancn tnccttcgtc cttgggcatt 660 

ngaacncata agaatgcacc cccnggntag gcccngtnna aaggttnatg aaggccntta 72 0 

taanttttgn nnccccaanc attaaantgg ctngattccc ttaatntttt cctcccnaac 78 0 



ccagnt 



786 



<210> 598 
<211> 809 
<212> DNA 
<213> Homo sapiens 

<400> 598 

ngttttnnnn cnnnttttct aatgcttgct tctcgttcct ttgcaggatc ccatcgattc 60 

gaattcggca cgaggacaga ccgttgagag gacgtggagg cccgagaggg ggtatncncg 12 0 

gcagaggcag aggtggccct gggaacagag tttttgacgc ttttgaccag agaggaaagc 180 

gagaatttga aagatatggt gggaatgaca aaatagcagt cagaactgaa gacaacatgg 240 

gtggatgtgg agttcgaacc tgnggatcgg gtaaagatac cagtgatgtg gagccaactg 3 00 

caccgatgga ggaacccaca gtggtggagg agtcccaggg caccccggaa gaggagtctc 360 

cagccaaagt tcctgagttg gaggtagaag aagaaaccca agttcaagag atgactttag 420 

atgagtggaa aaatcttcaa gaacagacca gaccaaagcc tgagtttaac atccggaaac 48 0 

cagaatccac tgttcttcca aagccgtggt gattcacaag tcaaaataca tagatgatat 54 0 

ggtaaaaaga tgactatgag gaccattccc atgttttccg gaaaaccccc cattgacatc 600 

acattccaac ttggagatta aattttgggt aacccttcct ttgtncttgg gccttngaac 660 

ccntgaagga aggcaccccn ggtgaagggc ccngggggaa agggattcan ggnaaggggc 72 0 

cantaanaaa ccttttggga ccccccttaa nccaataaaa tttggtngaa ttgcnangga 78 0 

atggtttgnc cccccnaaac cccnaaant 80 9 

<210> 599 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 599 

tttntaatnc tttttcnaat gctngcttca ggannntntg cangatccct cgattcgaat 6 0 

tcggcacgag ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa 12 0 

gaaacttcaa gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg 18 0 

tcttctaaat ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag 240 

agtacggtgc cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat 3 00 

atatcacctt ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg 360 

accaagtgtt cagaatttgc ttgactctaa cctggagagc ttcttaagtg atgccccttc 420 

atggagcttc tatgacagtg aataaactat taattgaagg aaaatgttat aattaatgta 480 
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tctatttgct gcattgtata tggattaaat gataaaaaac aagtaatcta ccctcagagc 540 

catgtatttg agaatgcttc aatcatattt tcctatgtac ctttttttta taaacttagt 600 

tttagactat gttgtaaaaa tggggaaagg ttgtaaacta tgtngtaaaa aatngggaaa 6 60 

tgtggcttta aaatatatnc attatatttg gttcaaggat tttggcaggg gntaaaggaa 720 
ncnatggttc aatctttgna tttatatacc ntgatttaa 759 

<210> 600 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 600 

ttttaatacn tttttnaatn cttgcttncg ntcctttgca ggatcccatc gattcgaatt 60 

cggcacgaga gcaattccac tcctagctcc acccacaggt aattgaaagc aaagacgcaa 120 

acagatgcct gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc 180 

gtcacagcgg natcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc 240 

ttcgccacag cggcagcatc tgtcgtcaca gnggcagcat ccttcgccaa agcggcagca 3 00 

tccttcgtca tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc 360 

actgaggcgt gcatagacta aacatggcca gtccaggcac tggaatccag gccgtanaac 420 

ggngcccacn gtcaaaagga atgagaccct gatgcactgg gcgacacaga cgggcgacac 4 80 

agacttggag acatcatgct aagtgaaaag ccaggcacac ggagcggacg gggtgatcct 54 0 

gctcacgtga tgtgtcccga atgggcacnt tcagagggga agaanggaga tggcgcttga 600 

cngtgnccgg gacnggggtt gggagcgacc ggttgttggt ttngggtttc tttctngggt 660 

gaaggaaatg tttttgatat tggggccgtt tgggtgatnt ttgcattacc ctttgaatat 72 0 

gcttanaacc cnctagaaat tgnnacactt tttaaatngn ttggaaatt 769 

<210> 601 
<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 601 

ntgtttaata ctattttcta atacttgctt tcgttctntt tgcangatcc catcgattcn 60 

aattcggcac gaggagacag cagcccccag ggaatgaagc tgatgccaga gtcagacccg 120 

aggaggaaga ggagccactg atggagatgc ggctccggga tgcgcctcag cacttntatg 180 

cagcaactgc tgcagctggn cctcaagtac ctctttatcc ttggtattca gattctggcc 240 

tgtgccttgg cannctncat ccttngnagg catctcatgg tctggaaagt gtttgcccct 3 00 

aagttcatat ttgangctgt gggcttcatt gnnagcancg nnggacttnt nctgggcata 360 

gctttggtga tnagagtgga tggtgctgtn anctnctggt tcangcanct atttctggcc 420 

agcagatgta nnctatatct gtgattactg gcacttggct acagagagtg ctggataaca 480 

gtgtagcctg cctgtacagg tactggatga tctgnaanac aggctcagcn atactcttac 54 0 

tatcatgcaa ccaggggccg gttgacatct aagacttgnt tattctatag ttcnagganc 600 

acaatggaat atgatccctt aactcctgat ttgggatcat ctgaaggacc aaggnnggca 660 

gtcttcgaag tggaataaaa tagccccggc ngtngtgact tgcacctata ttcccagact 720 

tttgggaggc naannttnga aggattgntt gccct 755 



<210> 602 
<211> 773 
<212> DNA 
<213> Homo sapiens 
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<400> 602 

nttgtaatag ctggtttcta aannntngnt ttcaacccct ttgcatgatn ccatcgattc 60 

gagcaaatca agatcttcag gtacagttgg accaggcact ccagcaagcc ttggatccca 120 

atagttaagg caactctttg tttgcagagg tggaagatcg aagggcagca atggaacgtc 180 

agcttatcag tatgaaagtc aagtatcagt cactaaagaa gcaaaatgta tttaacagag 24 0 

aacanatgca gagaatgaag ttacaaattg ccacgttgct acagatgaaa gggtctcaaa 3 00 

ctgaatttga gcagcaggaa cggttgcttg ccatgttgga gcanaagaat ggtgaaataa 360 

aacatctttn aagtgaaatt ngaaatctgg anaaatttaa gaatttatat gacagnatgg 420 

aatctaagcc tttagtcgac tctggtactc tggaanataa cacctattat acagatttac 480 

ttcatatgaa gctggataac tnaaaacaat agaaattgaa ngcactaaan gtgaattgtc 54 0 

atacaagcga aatgaaancn ttatttgana gccngcgggc ttctaacata ttgagcgata 600 

actttttgca aatgaaagat gcccttcngc tttntgaatt gnaaaatatt gaaacctgan 660 

agntnanctt agntgaattg aaacttaaat ttgaaccctg nacnanaccg gttaantgcc 720 

tgttcctgat aaaaanaagc cntnangtgc ttncctgntn gatttanccc ccg 773 

<210> 603 
<211> 784 
<212> DNA 

<213> Homo sapiens 



<400 
tgctttntaa 
gattcgaatt 
canattaggt 
ctgangaacg 
catgcaagca 
aggagtagct 
ggtggaaaaa 
antgcttgaa 
cctggaactg 
acttttccga 
nttcganaag 
anatgggaaa 
ntnttganct 
aant 



> 603 
tagctgtttt 
cggcacgagg 
gacggagcta 
agccggcggc 
tttcttaaag 
gccagtgcng 
tatcgcccaa 
aaaatcttta 
gaaaaacntc 
ttaagaattc 
aaagtgaaaa 
gccgttttcc 
taactgnntt 



taaatncntn 
gggacatcag 
agacttcgag 
ggtgacctgc 
gtacatccat 
gaagtagcgg 
aatgtgtgga 
gaaggnngca 
cactattttg 
ttgagttaaa 
atttttgctc 
ncctttttaa 
angcagcttt 



gctttgcgct 
tgatcgtaag 
accatctcgt 
acgagaagcc 
cagnactaaa 
agagaacaag 
tgaagttgct 
natcttccta 
gcagcaaact 
tgcatctgat 
aattaanctn 
gantgggaat 
taaaaaanta 



cnntttgcag 
tctcctgggn 
cctttntgta 
aggctaactg 
cccccgctga 
aaagccaaac 
ttccaggaan 
atctcttgtc 
tgagaactct 
gaacctggaa 
gtgtcaagga 
tcttngatga 
ccattggata 



gcatcccatc 
ccgttattct 
tcgcggaaac 
ggtgaagtac 
ccaaggatcg 
ccgttccctg 
aagtggttgc 
ttacggacca 
ttgggcctga 
tacaanttag 
aaatngnttc 
ngncnaattc 
aangagtccn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
784 



<210> 604 

<211> 801 

<212> DNA 

<213> Homo sapiens 



<400 
gttncnnctn 
atncggcccc 
tatagcgcct 
ggctctcgtg 
acttgggcca 
nggaggagtt 
taaaagcana 
ccctancggg 
gtgtgccanc 



> 604 
aacccttttt 
gtgtggggag 
cccctctccc 
ctctaggacc 
gagcaaanga 
ttagggcttg 
naacacataa 
actccaangn 
gcgtgggctg 



tgaaatcnnt 
acngacagca 
ctcagagggc 
tgcacgggac 
aaacctcttg 
gggagaggga 
ccttacaact 
agccacacgc 
gcgtgtgaan 



ngcttctact 
ccctttttnt 
acagctgcag 
ttgtggatgg 
ttttaaaaaa 
ggacacatnt 
gcctggcaag 
cccttctgga 
atctatnaaa 



ctttggcatn 
ctggcatttg 
gcctgaccaa 
gcctggactc 
attttnttca 
ggaggaaatg 
cccaatatca 
agggtgtgng 
taagtatana 



catnccatcg 
cccttgangc 
ggccacgccc 
tccagaaact 
nagtgttttg 
gccttctttt 
cttgtttggg 
catgtnaant 
tgggngtnta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ntatatgtgt ntaaaataaa ngantggaca tatttggncc tctgngnana nncttngaat 6 00 

ctaagncaag agtnnntctn gaaaaacnaa ananagtnct ntntanannt ttacgtaatn 66 0 

atcaatactn tntccacntn accctnctnn tanntntncc natatantcg antaattcnt 72 0 

cactcntnna ttcctngtna acacnaatna atnnaactat naaatatntn tnctnnntan 780 

tngacatann catncrmncc g 8 01 

<210> 605 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 605 

gnttctaatg tggttcnaan acttgctttn gctcctttgc aggatcccat cgattcgaat 6 0 

tcggcacgag agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa 120 

aggggacttg ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac 18 0 

cctccaagtc tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg 240 

ggcttctatc tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct 300 

ggcgggctgt cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc 360 

ctggccttca cccatnccta gccagcagct cccagtcaga caacagccag aaatgtctcc 420 

agactctgcc cagcctccca ggtagccacc ctcgagacat gacctcagag tctctgtgtc 480 

tcctagaagc ctgacagaga cccccanggc agtgggtggg tggcgggcta gagacccttg 54 0 

cctgtgtccg ggaccctggc gccgntcttc cctcctgtgg atcccttcgc acttacaagt 600 

gttctnaant gggcagacgc ctgggcaccc cttgggccct gcccaancat ggccatngng 66 0 

cangcttttt naacccgcat nggntttcca ngcctggtga atcttgcttt tccanggacn 72 0 

nnttggaacc tttcctncgg ggcggggccc ccnagcnct 7 59 

<210> 606 
<211> 809 
<212> DNA 
<213> Homo sapiens 

<400> 606 

tctncgtnaa tcnnnnnttt aaaagccttt gcttttgctc nctttgcttg atcccatcga 60 

ttcgtgactt tgtacctggt ccaagctgat ggggttttgc tgctgttgac ccaggcagga 12 0 

gtctgactag agaacaaact aaggttgctg caacaaacaa ggacctcttc caagaagggc 180 

tcccaggcct ggcgcagtga ctcatgcctg tgatcccagc acttgggagg ccnaggcggg 240 

tggatcattt gaggccagga gttcgagacc agcttggcca acatgatgag accccgtctc 3 00 

tattaaaaat acaaaaatta nccaggcgtg gtggcgcctg tagtcccaac tactcaggag 36 0 

gttgaggcag gagaattgtt gaacccggga ggcggangtt gcaatgagcc aanatagcac 42 0 

cactgcactg catccttggg tgacagaagc gagactccat cttaaaagaa gggctcctgt 4 80 

gtctacgtca tggtggggct anagagangt cccngcagct gggctgtgtt gagtganngg 54 0 

ctnntctttt naannccagg caatagtttg tcttgactct gtccttttct gngtccacat 600 

gacattttac atntttncnn agtttnctta atttaaagtt gnctaatttt accattatac 660 

attttnaatt ggcatttctt ttaccnatnc tttttgtntg aaaatggtan tntttgaaat 72 0 

cngnatcngt tctaatgngn tntattttna ccnaatgcca atnntacctn ctttgnaana 78 0 

atntattcgt tttcnaagnt tnaacctct 809 

<210> 607 
<211> 788 
<212> DNA 
<213> Homo sapiens 
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<400> 607 

tntttctaat acnagtttnc aagncttgct ttnnnatccc tttgcaggat cccatcgatt 6 0 

cgcaaggccc gaggtgccat cccctctggg aagcagaagc ctggtggcac ccagagtggg 12 0 

tactgttcgg taaagagctc accctctcac agcaccacca gcggcgagac agaccccacc 180 

accatcttcc cctgcaagga gtgtggcaaa gtcttcttca agatcaaaag ccgaaatgca 24 0 

cacatgaaaa ctcacaggca gcaggaggaa caacagaggc aaaaggctca gaaggcggct 3 00 

tttgcagctg agatggcagc cacgattgag aggactacgg ggcccgtggg ggcgccgggg 3 60 

ctgctgcccc tggaccagct gagtctgatc aaacccatca aggatgtgga catcctcgac 42 0 

gacgacgtcg tccaacantt gggaggtgtc atggaagang ctgaanttgt ggacaccgat 480 

cttctcttgg atgatcaaga ttcantcttg cttcatggtg acgcagaact ataaagccct 540 

gtgtncactt atagacagtg aaaacccacg ggtcttcatc tttattaatc nngaaacctt 600 

ggaatgcctg ctttgttttg taaccccttt ttaaaaccta cctgttttta aaaagtggtc 660 

atttttantt nacgntttan aaanaaaaan tcctatttct ttttcctttt nattttaaaa 720 

aaaaattngn tttttgttgg ggggntttgg ggggaattaa aataatttgg cccccaactt 780 

taaaaaat 788 

<210> 608 
<211> 796 
<212> DNA 

<213> Homo sapiens 
<400> 608 

tcttttaatg ctttttncaa gccttgtttn aaatcctttg caggatccca tcgattcgaa 60 

ttcggcacga gactaccccg gctacggttc ccccatgcct ggcagcttgg ccatgggccc 120 

ggtcacgaac aaaacgggcc tggacgcctc gcccctggcc gcagatacct cctactacca 180 

gggggtgtac tcccggccca ttatgaactc ctcttaagaa gacgacggct tcaggcccgg 240 

ctaactctgg caccccggat cgaggacaag tgagagagca agtgggggtc gagactttgg 300 

ggagacggtg ttgcaagaga cgcaagggag aagaaatcat aacaccccca cccnaacacc 3 60 

nncaagacag cagtcttctt cacccgctgc agccgttncg ttccaaacag agggccacac 420 

agaatacccc acgtttttat ataaggagga aaaccggnaa aanaatttaa aagttaaaaa 4 80 

aatanccttt cngttttaca ctactgntgt agactcctgn tttcttcaan cacctgnaga 540 

ttcttgattt ttttgttgtt gatgntctct ccattgcttg tngtttgcnt gggaantttt 600 

atttaaaaaa aaaaaaaatt cttgtgagtn gactttggnt tttaaaccan tgntagattt 66 0 

taacngnacc cttaatgggt tgtacntata tgntttnaaa acatgnnaan aaatatttaa 72 0 

tgtaaaggnn ctgttnntaa atntaaccac ntanagaant tnnaaannnn ttnanccctt 780 

tagaacnatt nntgng 796 

<210> 609 
<211> 790 
<212> DNA 

<213> Homo sapiens 
<400> 609 

gnnntttaaa nacctntttc aatncttggt ttnnaatcnt tttgcaggat cccatcgatt 6 0 

cgcatccagg gagaacctcg gggctgggac acctcctggc cctcaccctg ggtcatgttt 12 0 

acagtcctca gtgccccaca ccggtggccc cctgaggaca cctccaccct gaccttgatt 18 0 

ttcccaaacg ctgcctcttg gtgacagact cagcccaaaa ccccttcctt ctgtctctgg 240 

agacccttga gcttggggaa atatggaggg gtgtgtgtct gcaatcaagg cctctgcagc 300 

tcacggctgg cccggtgggc tgggacttcc gtctgaattt taaatactta gggntcattt 36 0 

tttttctctg gcaacaaagc ttgatgtttt cactgcttta gtttcctgtt tgctggtggg 42 0 

aggggatacg gtctgtgact ctggacttgc tctgggggaa cagttgtcac tgcccccngg 480 
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gagaggggca gcttgggctt ggaagaaagc acaccccnga gaccagagcc ccttcnagag 54 0 

ggatncttgg ctgcttcatt gnctttcccc cagcaagccc tgctcttcca caagcncctt 600 

ntggggtctt gggtatggtc ccccgntcac cttctttcca nantccctga nntggtgtag 660 

ggttgtgggt tggcacangg aattttgggg cattggggaa ggggntttca aaacttttnc 72 0 

caaanacccc cgtgttcctn ngnaaaattn aanttggtgg gcttngggtg ntnaccccca 78 0 

antcttngnc 790 

<210> 610 
<211> 786 
<212> DNA 

<213> Homo sapiens 
<400> 610 

gatgtttnnn annctggttc taatncttgg aaanctncnn ctttgttann ngcnntttct 60 

gcaggatccc atcgattcga attcggcacg agcccagctg gacctggtgg ccctttccta 12 0 

gtgcctctgc tgggggagga gaacctctgt ccacgtggag gctaggaggt ctcaggtgct 18 0 

gccctggcag caccagagtg tgggccgggc ccgagtgtct gcccctcggc cctcagggtg 24 0 

gggcacttag cacccagaag ggaccaaaag cagggcatgg cggtgcagag gagtttggga 3 00 

ggtgtaaaca gccccatgca cgtggaggag gagctggctt tcagccccag accccacgct 3 60 

agcactttcc acgctgcttg cccgctgttg atgtgcagtt cccagtgcct gtgtgagccg 42 0 

acatctgctc agtcctatcc ctcgtcagcg tgtggagacc cagctcctgc aagcccttct 4 80 

gcttccacgc ccccagacag cttggtggag ggtcctgcat ctgggccaag ctggggtgca 54 0 

cccagccaaa gacaaagctg ccttcacgtg cccaaaggat tcaagatggt gcactggccc 6 00 

cgggaggagt cttgaccaaa aatgggagcc cgctcttgtg gggaaanccc cgacttcccc 660 

caccnanaaa ccgntcccac ggtgccggan cttccccctt ttcctttgtg ggggcaacaa 72 0 

nattggcctt gggcnctttc aattnttncg gaagctttcc tgggtgtngg cttttgacct 78 0 

taaaat 786 

<210> 611 
<211> 938 
<212> DNA 
<213 > Homo sapiens 

<400> 611 

tgttttaaag ccctntttng aatncntggc ttncgncccc ttggcaagat cnctctctgc 6 0 

aggatcccat cgattcgttt gtatttttag tagagacagg gtttcttcat gttggtcagg 12 0 

ctggtctnaa actcctaacc tcgtgatccg cctgcctcga cctcccaaag tgctgggatt 18 0 

acaggcatga gccaccatgc ccagccaaag atcatttttt tatatagact tcagnccttt 240 

gtaaatattg taactgggga gtatagagta gaaaaaaagt atagntaaaa catttgttct 3 00 

acaaattaac ctttaaaaat ataattactg ctaaaaatag agtgctgtta cacttaagga 360 

aaattagtgc cattttggaa atgagatctt gtgccataaa tncagctgaa ctgaatataa 420 

atgttcacaa attaatgctg tnaaaggaat gagttaagca gaaaaacttt taaccagcac 4 80 

ttctcaaaaa anaaaannna nnaattaaat nntataancn ncatnnanat ntatnntann 54 0 

tttncntctn nattncanta attttgtntt ncaaatantt nnacctnnan ctntgttntn 6 00 

nttnnnncna tnnantatcn ntttatcnan tatatnatta nctnattntn nngnanngna 66 0 

tcntnctcta tncnnnatnn tncatatnnc gtccnntnnn nnaantatgc ctcatnatat 72 0 

ntacnnnaaa ngtntangta tgnttantgc atnnncatna ctnntgatgt cnnagtnnna 7 80 

natattttgc cnctcattat tntgctnatn tatntgtttg acacannata ctnnnancna 84 0 

ttcatcttct cgcaatnngn gnacttttna nttacnnnna tgntannnnt natatatnta 9 00 

tcattagana ccttttnaat tntnnntncn nanacgcg 938 
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<210> 612 
<211> 771 
<212> DNA 
<213> Homo sapiens 



<400> 612 

tgtttgnaan nncggntntt gaaatncntg gtacnnaaac nctttngnaa ancncccctc 6 0 

nctgtntgat cccatcgatt cgaattcggc acgagataga aactaggcac tgatttgttt 12 0 

atatttntcc tgctcgagac acatgatgtt tcatgtatct gtggcttttt atagtttaaa 180 

ataatttctg gaaaagtcat agtcattatc tctttaaccg ctccctctct tccattctct 240 

ttgttctctc ttcctcgaac tcctgttagt catttgatcc tccatatctc tgaatatttt 300 

tgtatttctt ttattattta tttcttgtct ctgctacatt ttacattgag taaaagtggg 360 

atgtgacagt gggaaatcat tagtgactta gaaattccag ttggtcattg ggccaatttt 420 

gatgctacct tctctctttt atttctcact tcaaaataaa atttgcaaaa acaaaaaatt 480 

aaatatagta tgagtccagt tactggccta aggagctaaa agcattctgg gtttgtatga 540 

agacagctga gttataacaa atgagagtac tgttgtgtga ctgcattaat tattcccttt 600 

ttaaatgtac aagagcaang cattctacct gactgngtta ttgagctctg cancatacat 66 0 

ggtgacanag ctaaaacaan acaagccnaa ccnanaagga aaaccccagc tttagggata 72 0 

ctctgntcat ngaatatagc ctgaaaaatg gntaatcaag aaagtnaacn t 771 



<210> 613 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 613 

tttgaatcct tgctttcaaa tncttggcac tngccctctc tgnaggaatc ccatcgattc 60 

gaattcggca cgaggtaacg tgacacgtat tttacttctt ttantaggcg gacacacttt 12 0 

cttaaagtaa taatacgtca tggccctgct ataaggtagt agttctagaa gactgtntat 18 0 

ctaataattc agactaaagc tatttatatt gctgtgacac cacgtggaaa acttttataa 240 

ttccatctta tttctgatgt atatgtttta ttttctctgc cttcataaga actaaaaacc 300 

aaagttattt acgtgaaaac aagatttttg tttgagttca tttacttgag atatgtttaa 360 

aaaatccacc ttctgtcaca ctatagaagt atattttgaa ttatcaaaag gtagaattat 420 

aactttcana aaagaaaaaa atggtcaatt tantttaact ctatgtcaaa aatttattta 480 

tagtctcata tattcattcc acaccccccg ttcttctttc cttctttctc cctctgcctt 540 

nttcttaatn atnattttta aattctgacc aaaaataaag tngtggcaag tactttctta 600 

gcataacctg gactggttga agnagtaatt ctgntccttt aaaaaaantc cccaactggg 660 

ncccnggnca ggnacaaaaa nttntaanga acatntggga attangcnaa atggatnttc 72 0 

cttggaggtc caacccccaa aaatcattag gncnaccaaa attnaaaata atcg 774 



<210> 614 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 614 

ttggantctt ctcngaaacn cttngcnatt gcnctntctg naggatccca tcgattcgaa 6 0 

ttcggcacga ggttcttcaa agccaaccaa gacaggcttn tnagttttag agcttcagaa 120 

caaattgcca aaagccagag ttgtttatgc tagtgcaact ggtgcttctg aaccacgcaa 180 

catggcctat atgaaccgtc ttggcatatg gggtgagggt actccattta gagaattcag 240 

tgattttatt caagcagtag aacggagagg agttggtgcc atggaaatag ttgctatgga 300 
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tatgaagctt 
aattgaggaa 
gtgggtcatt 
aatgaagaag 
tgcatagcaa 
ggaaaaatgt 
ctttggaaag 
gtgttgcnct 



agaggaatgt acattgctcg acaactgagc tttactggag tgaccttcaa 
gttcttcttt ctcagagcta cgttaaaatg tataacaaag ctgtcaagct 
gccagagagc ggtttcagca agctgcagat ctgattgatg ctgagcaacg 
tccatgtggg gtcagttctg gtctgctnac cagaggttct tcaaatctta 
tccaaagtta aaagggtttg tgccactagc tcgagaggaa atcaangaat 
gttgtaattg gtctgcantc tacaaggaga agctangaac atttagaaag 
aaggccggng ggagaaattg aatgattttt ggtttcaact nccaaaaggt 
cccttctttg aaaaaacatt ttct 



480 
540 
600 
660 
720 
754 



420 



360 



<210> 615 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 615 

tgtttnaatg ctgttttgaa atcttgtttc aaatcctttg gctacttgct ctntctgnan 6 0 

gatcccatcg attcgaattc ggcacgaggg attctttcac tgagcacaaa gagttgttgg 120 

ggctttagca tctgactgat tttgttacgg ggttgattct gaccatagga agtatgcaat 180 

gtgaatcact atttacagag aaacctacaa cagatgcttg atgttgtaga aactgggaca 240 

tatagatacc aagcaaaatt ataagaaacc tataaggtgt tcaatacgct tgtgtttcca 3 00 

aaattcactg tncatgatca gtttggtgtt cttgtaccac agtttttaac tgaaggaacc 360 

agttgtaaca gtctcaattt ttaactaaaa cttgaagaac taanacaaca atgcaaacct 420 

ttcagcattg tttggccaaa cttgttaaaa ctgtaatgca agaaccaaat gcactgtgat 480 

gtggcaccaa ctaattagca agcatgaatt tttcacccaa nagtgaaaaa aggaaaatct 54 0 

accatggctt naagtttaag agcagaactt cctgactncc attctatgac tgatcaaaaa 600 

nactaatagt ttaaaacctn agcangcctt gttcacgata tgcngaaaaa aaaagtgctt 660 

gcagtttann atccttatgg aantttttca cantgtnaca nggtnttgta atacnttgga 72 0 

ngccctacat tttcntanga atntattttn cttggcctaa nttggnntca angc 774 

<210> 616 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 616 

atnncntttt tgnaatcctc tctgaaatcc tttgctactt gctctttntg caggatccca 6 0 

tcgattcggc cagtcctcac cttccctagt cctcgtgtgt attttaggag atgcgtgggt 12 0 

gtggaacagc ctcctgcctc cggtccaggt gtactggggt ctgtgtgttg tgtttctgcg 180 

tgttctcggc agaaagtggc atgctgtccc gcctgggtga tttgctcttt tacactattg 240 

ctgaaggaca ggaacgaatc cctatccaca agttcaccac tgcactaaag gccactggac 3 00 

tgcagacatc agatcctcgg ctccgagact gcatgagcga gatgcaccgc gtggtccaag 36 0 

agtccagtag tggtggcctc ttggaccgag atctcttccg aaagtgtgtg agcagcaaca 42 0 

ttgtgctcct gacccaagca ttccgaaaga agtttgtcat tcctgatttt gaggagttca 480 

cgggccatgt ggatcgcatc tttgaggatg tcaaanagct tactggaggc aaagtggcan 54 0 

cctacatccc cttnctggcc aagtcaaacc cagacctgtn gggtgtctnc ctgtgcactg 600 

gtggatngtc aanngcactc ttgtgggcca cacaanagat tccctttttg cctgcaanac 66 0 

cntgtntgaa acccccttaa cttatngccn atttncntna agcaaccctt aggcnanttg 72 0 

actnncnttc acaanttttt ggggcnaaag anncnaattg gcctgccct 76 9 



<210> 617 
<211> 766 
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<212> DNA 

<213> Homo sapiens 



<400> 617 

aganntcttc ctttctaatn nctngctacn ttctctntct gcaggnatcc catcgattcg 60 

cttcctcaaa gcatggttgc tgagnaccca nagttgcgag gngttttttt actgatttag 12 0 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 180 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 24 0 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 3 00 

tgtgaccgga attatgggac atgctgtgca nactgttgaa actatgaatg aaggggacca 360 

tagagtgagg gagaagctga tgcatttgtt cacgtctgga gactgcaaag catacagccc 420 

agaggatctg gaagagagaa agaacagcct aaagaaatgg cttgagaaga accacatccc 480 

catcactgaa cagggagacg ctccaaagac tctctgtgtn gctggggtnc tgactataga 540 

cccaccatat gggtccacaa naantgcagc atctctaatg aganttattc ttgcccttng 600 

ttcaangatc ttattgaaag gacatcttac agcttttccc aatgagaang cccangaagt 660 

gttaaacata ctgnnttgaa aaaagcactn tatnttntcc cntnttaana tggtntctaa 720 

aatgtanaaa naaannaaaa naaaanctcg atccctctnn aacnct 7 66 



<210> 618 
<211> 762 
<212> DNA 
<213> Homo sapiens 



<400> 618 

tttnnagnnt cttcctttct aatggcttgg ctactngttc tttntgcagg atcccatcga 60 

ttcgctcagt gcagcgatca tggctcagtg cagcctcaaa ctcttgggct caagcagtgc 12 0 

tccaacctca gcctcctgag tagctaggac tataggcaca cagcaccatg ccccggctat 180 

ttttttattt tgtagagatg gggtctcact atgttgccca ggctagtctt gaactcctgg 240 

cctcaagcaa tcctcccacc tcggcctccc aaagtgctgg gattaaaggc gtgagccacc 3 00 

gtacctggcc cttggtggaa tctttagggt tttctattca tacatataaa atcatatcat 3 60 

tggcaaacag agataatttt acttcctcct ttccaatttg gatgccttag atttcttttc 420 

cttgcctaac tgctctgtct agaactccca gcactatgct gaatagagtg gcaagagcag 4 80 

gcatttgcct tgttcctaac cttagagaaa aatccttcag ccttttacca ttgaggatga 540 

tgtttgctgt tagtttttca taaatgatct atatcaggct tgaataaatt tctatttcta 600 

aaaanaaaaa atataacnnn ntanttnatn aantnnttaa naaaanaaaa actggnacct 660 

ntaaaactta tagtngagtc gtttnaccgt anatcccana ntttgataan gatacattgg 72 0 

atnanttttg gacaanccnc aactaggaat ngcnntgnaa at 762 



<210> 619 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 619 

tttggagntc tttctttcta atncttggct actngntctt tntgcaggat cccatcgatt 6 0 

cgaattcggc acgagcggac ccatcggagc gtaacctgga tctccgcagg cc tggcggag 12 0 

gccggccacc tggaggggca ttgcttggtt cgcgtggtag cagaggagct tgagaatgtt 18 0 

cgcatcttac cacatacagt tctttacatg gctgattcag aaactttcat tagtctggaa 240 

gagtgtcgtg gccataagag agcaaggaaa agaactagta tggaaacagc acttgccctt 300 

gagaagctat tccccaaaca atgccaagtc cttgggattg tgaccccagg aattgtagtg 360 

actccaatgg gatcaggtag caatcgacct catgaaatag aaattggaga atctggtttt 420 
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gctttattat tccctcaaat tgaaggaatn aaaatacaac cctttcattt tattaaggat 
ccaaagaatt taacattaga aagacatcaa cttcactgaa gtaggtcttt tagataaccc 
ctgaacttcg tgtggtccct tgtctttggn tataaatgct gtaaggtggn agccantaat 
tntctgcaan aagtangnca gcacttttca gtgatttgaa tatcatcttg gcttngangc 
cangtggaca accttgtcat aactgacttc tgaaaagaac cctntngata tttgatgcct 
cnggtgtngg tggaactgtc atttantngg anna 



480 
540 
600 
660 
720 
754 



<210> 620 

<211> 767 

<212> DNA 

<213> Homo sapiens 

<400> 620 

gcgttctttg aaagccctnt tttgaaaggc ttgcttctaa ttacgggaaa cctttgcaac 60 

tgcagatccc atcgattcga attcggcacg aggacccagg tagaccagct caagagttca 12 0 

tgttctttgt natcctcctg tgagctctct gtaagtcnnt ttcttgccca tcaccacatc 180 

cctagtactg ggtatcagtc tggccacttg gctttctggt ttgccccaat gtggtctatt 240 

cttgatgcag ctaccaaagt aatgttttaa aaccattata ccaagttact atccttgtca 300 



<210> 621 
<211> 828 
<212> DNA 
<213> Homo sapiens 

<400> 621 

tttctaatag cttgctttct aatnctnggn aacgctnggt ctctgnagga tccctcgatt 60 

cgaattcggc ncgaggggtg acagagtgaa actcgtatct ccaancaaac aaacaaaaag 120 

tncttaaaca tatgtgaaca aaaatttngt gatggaagga ttctagttaa tgagtattgc 18 0 

atcaagattt acatctttct tactaaggaa aagagttaat aaaaatngnt ctttatttta 240 

caggcagnta ctgaggctct tcccanntcn cagtanacag ccactcagcc ttgaaaatgg 300 

agtgttgttg tttctaaaca tatatttatg tcatttattn aggtacagtt cacttaaata 360 

accataagtn gantctctct tgtnagtgat ttgggtagga agaggccatg tctanagttc 420 

natttctctg ttgggtccna ntgaaattgg accttttnag ttgttcanaa aaatnaanat 480 

aaattnctca tattaaatca agannctcnt caanttatag atgtggggta gggttccnng 54 0 

taaaacccat tatnaatcta gaaaattatc nctatngana angcntttaa tatctnttac 600 

cntgaaattc attactttag tncaaggcct acctttaaan gtttnnncnn gaaccatttt 660 

tannnnttcn ncttttgnnc caananntca ttttaancca ccaaaantcn caattnttnt 720 

tncattnnaa tannggatgn naattatnnn atcnatgtgt catatttnac canganaata 780 

ctgngctncn tgnaataatn ggtacactaa anncnngann tttnntcn 82 8 



aaacccccag taactgccaa 

ataaactggc cactgcctat 

ccttccagcg ccgttattct 

agagtaggaa aattactggt 

ttcattgcng aatcttctnc 

nctatnatgc cc tact cat t 

attgctaacn tgaaatacac 

nttngcactg gnnaggcnaa 



tctcacttag aataaaatcc ggactcctgt gaagcacagc 
gcagcaacct catctttacc gnttcctgcc ttgctcactc 
tcctgatgcc cctagtacac aacaactcct tcctgctcca 
ctctctgcca gngagaancc tcttctggna ttacctttgc 
aatatcatct tctaaaaaga gccttttaaa aatcaccttt 
tccagtccct gaaanggcca ttcccacttn antannactt 
taaatgnnan ccttcatgaa nggtanggca anttaaatgc 
gagaacaagc ancntggntt canaagn 



420 
480 
540 
600 
660 
720 
767 



360 



<210> 622 
<211> 784 
<212> DNA 
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